


Applied mathematics comes down ultimately 
to numerical results, and the student of any 
branch of applied mathematics will do well 
to supplement his usual mathematical equip- 
ment with a definite knowledge of the 
numerical side of mathematical analysis. 

From (he Preface to the First 
Edition. 1930 

For more than 30 years Numerical Mathe- 
matical AnaUsis has been the outstanding 
textb(H^k in its field. Each successive edition 
has improved on those preceding it, correct- 
ing, clarifying, adding new material, explain- 
ing new techniques. 

This enlarged sixth edition includes the 
following signiliCvim additions : 

1'hc Trapezoidal Rule 

An additional method for finding the 
roots of numerical equations 

Formulas and computing schedules for 
the numerical solution of simultaneous 
ditferenlial equations of the second order 
b> the Runge-Kutta method. 

Numerical Mathematical Analysis sets 
forth clearlv and svstcmaticallv the most im- 
portant principlcN, methods, and processes 
used for obtaining numerical results. It also 
describes methods and means for estimating 
the acenraev ol such results. Among the 

( continued on back flap ) 


Original L\ S. Ethtion 

Indian Edition 


S K.50 
Rs. 13.00 



NUMERICAL MATHEMATICAL 
ANALYSIS 




NUMERICAL 

MATHEMATICAL 

ANALYSIS 


BY 

JAMES B. SCARBOROUGH, Ph. D. 



OXFORD & IBH PUBLISHING CO. 

Calcutta Bombay New Delhi 



pUHLisfiFn liY f>\i<)iti> iHii |•UHU.slfl^^; c o., >(>, < iiowiUNCiihK 

^AIXUJTA in AM) IMIINIKI) IJ ri[fH:n APIlIf \rj.A AT'IIlh IION PIU SS J'llJ V M K 

LIMITLI). 107, r.I.I V SIIUI.I, ( AU.UITA -I 



To the tnetnory 
of my son 

JAMES BI^IME SCARBOROUGH, JR. 




PREFACE TO THE FIRST EDITION 


Applied mathematics comes down ultimately to numerical results, and 
the student of any branch of applied mathematics will do well to supple- 
ment his usual mathematical equipment with a definite knowledge of the 
numerical side of mathematical analysis. He should, in particular, be able 
to estimate the reliability of any numerical result he may arrive at. The 
object of this book is to set forth in a systematic manner and as clearly as 
possible the most important principles, methods, and processes used for 
obtaining numerical results; and also methods and means for estimating 
the accuracy of such results. The book is concerned only with fundamental 
principles and processes, and is not a treatise on computation. For this 
reason little attention is paid to computation forms, the assumption being 
that the reader who has much computation of a particular kind to do will 
be able to devise his own form. 

The plan of treatment followed throughout the book may be briefly 
stated as follows: Each major subject or topic is introduced by a short 
statement of “ what it is all about.” Then follows a brief statement of the 
underlying theory of the subject under consideration. With this theory 
as a basis, the processes and formulas are then developed in the simplest 
and most direct manner. Formulas and methods for checking or esti- 
mating the accuracy of results are also worked out wherever possible. The 
reader is then shown just how to use the formulas and processes developed, 
by applying them to a variety of examples. Finally, the limitations of the 
formulas and the pitfalls connected with the processes are carefully pointed 
out by means of appropriate examples. Notes and remarks are also added 
wherever they will throw further light on the subjects under consideration. 

The treatment of all topics has been made as elementary as was con- 
sistent with soundness, and in Some instances the explanations may seem 
unnecessarily detailed. For such detailed explanations no apology is 
offered, as the book is meant to be understood with a minimum of effort 
on the part of the reader. Moreover, experience in teaching certain topics 
has shown that even a good student must receive considerable assistance 
from teacher, textbo(>k, or some other source. I have tried everywhere to 
clear up the difficulties before the student meets them, so that no teacher 
or other source ^f information will be needed. In order to make the book 
everywhere as readable as possible 1 have purposely refrained from using 
notations peculiar to certain subjects, and from employing symbolic methods 
and divided differences in deriving the standard formulas of interpolation. 

vii 



viii PKEKACE TO THE FIRST EDITION 

A knowledge of calculus to the extent of the usual first course is all that 
is needed for the understanding of anything in the book. 

The more important formulas throughout the book are numbered in 
heavy black type to distinguish them from those of less importance. 

The worker who is to obtain numerical results with a minimum of 
effort must provide himself with every possible aid for lessening the labor 
of his task. In addition to such aids as slide rules, computing machines, 
and logarithmic tables, the computer will find that Barlow’s tables of 
squares, cubes, etc., and the Smithsonian Mathematical Tables are prac- 
tically indispensable. Crelle’s ‘‘ Calculating Tables,” Jahnke and Emde’s 
“ Funktioiicntafcln,” and Jordan’s “ Opus Palatinum ” (tables of natural 
sines and cosines to seven decimal places) will also prove their worth in 
many instances. 

In the preparation of the book I have consulted the writings of the 
majority of previous writers on the subjects treated, and am indebted to 
many of them for ideas and methods; but my greatest debt is to the 
writings of the late and great Carl Ruiigc, who undoubtedly contributed 
more to nunii'rical mathematical analysis than any other man since Qauss. 
Keferences to the works of other writers will be found here and there in 
the text and in footnotes. 

It is a pleasure to record my thanks to the U. S. Naval Institute for 
permission to use certain copyrighted material which I originally prepared 
for Engineering Mathematics • to Dr. L. M. Kells, of the 
U. S. Naval Academy, for helptui criticism on parts of the manuscript; 
and to the Johns Hopkins Press and the George Banta Publishing Company 
for their hearty cooperation in meeting my wishes concerning the make-up 
and publication of the book. 


J. B. SOARBOROUOH 



PREFACE TO THE SECOND EDITION 


In this revision all known errors and misprints in the first edition have 
been corrected. A considerable amount of new material has been added, 
and a small amount of material in the previous edition has been left out. 

The chapter on numerical integration in the original edition has been 
rewritten and augmented to a considerable extent. All the material dealing 
with the numerical solution of ordinary differential equations has been 
completely recast and augmented in various directions. Much more atten- 
tion has been given to methods of starting the solutions, and all the best 
methods for that purpose have been treated in detail. 

The major part of the new material consists of a section on the accuracy 
of the solutions of systems of linear equations, the new material dealing 
with numerical integration, the new material dealing with the numerical 
solution of ordinary differential equations, including the derivation of the 
equations of exterior ballistics, a rather lengthy chapter on the numerical 
solution of partial differential equations, and a shorter chapter on the 
numerical solution of integral equations. Other new material in smaller 
amounts has been added in various places. 

In all new material, as well as in the old, an effort has been made to 
make the treatment unmistakably clear and understandable everywhere 
and in all respects. Although the utmost clarity was aimed at in the first 
edition, clarity has received even more attention in this revision. 

The exercises at the ends of the chapters have been changed and aug- 
mented to some extent, and the answers to the majority of them are given. 

During the past fifteen years the computer has been provided with great 
and revolutionary aids. The many volumes of W. F. A. Tables, sponsored 
by the National Bureau of Standards, have met a real need of long 
standing; the great automatic calculating machines have performed with 
ease and rapidity many calculations that were prohibitive in labor and time 
by the older hand methods and hand machines, and they have turned out 
volumes of tables in a matter of weeks ; and, finally, the important journal 
Mathematical Tables and Other Aids to Computation serves as a clearing 
houSe in matters of computation and enables the computer to keep up 
with progress in computation throughout the world. The computer will 
be wise to make use of these aids whenever possible. It seems no exaggera- 
tion to say thitt during no other fifteen-year period in the world’s history 
have such great strides been made in the art of getting numerical results. 
May the strides continue 1 

I wish here to thank those readers in various parts of the world who 

ix 



X 


PREFACE TO THE SECOND EDITION 


have kindly pointed out errors and misprints in the first edition of this 
work. I shall be grateful to future readers who may notify me of errors 
or misprints in the present edition. 

It is a pleasure to record my thanks to Professors A. E. Currier and 
S. S. Saslaw for putting at my disposal their unusual knowledge of mathe- 
matical analysis, and to Professor J. M. Holme for his excellent work in 
drawing the figures in their final form. • 

Finally, I wish to extend my thanks to the Johns Hopkins Press and 
the J. H. Furst Company for their hearty cooperation in meeting my 
wishes relative to the make-up of the book. 

J. B. SCARBOBOUOH 

PREFACE TO THE THIRD EDITION 

This edition is mostly an enlargement of the previous edition. The new 
material consists mainly of an article on the errors in determinants and 
a chapter on the numerical solution of simultaneous linear equations. 
All known misprints and errors in the second edition have been corrected 
and a few other minor improvements have been made. 

In recent years the numerical solution of systems of linear equations 
has become a subject of major importance, due mainily to the widespread 
use of automatic computing machines. In the new chapter of this book 
several of the best methods of solving such equations have been treated 
in detail and illustrated by numerical examples. An attempt has been 
made to make the treatment as clear and direct as possible. 

Here I wish to record my thanks and indebtedness to Dr. Morris Newman, 
of the National Bureau of Standards, for bringing to my attention the 
method of inverting matrices used at the Bureau of Standards and for 
explaining certain points connected with thp method. I also wish to thank 
all readers who have kindly brought to my attention some of the errors 
and misprints in the second edition. 

Finally, I wish to record my thanks to the Director of the Johns Hopkins 
Press for his interest, encouragement, and cooperation in bringing out 
this edition. 

J. B. S. 


PREFACE TO THE FOURTH EDITION 

In preparing this edition 1 have made major changes and additions in 
Chapters X, XI, and XVI of the third edition and have also made minor 
changes and additions in other parts of that edition. 



PREFACE TO THE FOURTH EDITION 


xi 


An article on the convergence of the Newton-Rapheon method has been 
added in Chapter IX. In Chapter X, Brodetsky and Smeal’s perfecti<m 
of Graeffe’s method has been explained in detail, somewhat simplified and 
modified, and illustrated by two worked examples. An article on improving 
the accuracy of complex roots found by the Graeffe method has also been 
added. 

A few important changes and additions have been made in Chapter XI. 
In Chapter XVI a section on smoothing experimental data has been added, 
with illustrative examples and exercises. 

All known misprints and errors in the third edition have been corrected. 

J.B.S. 


PEEFACE TO THE FIFTH EDITION 

In this edition the material of the fourth edition has been augmented 
by the addition of new material and rearranged in places to improve logical 
order. The most important additions are a chapter on interpolation with 
unequal intervals of the argument by means of Newton’s general formula 
of interpolation, the derivation of all central-difference interpolation for- 
mulas by means of divided differences, and methods of investigating the 
errors in the solutions of single equations and systems of linear equations 
when the coefficients are subject to errors. Several minor additions and 
changes have also been made. 

Some of the exercises in the fourth edition have been changed and new 
ones added. Answers are given to all exercises except those on differential 
equations. All known errors and misprints in the fourth edition have been 
corrected. 

J.B.S. 


PREFACE TO THE SIXTH EDITION 

This edition is a slight enlargement of the fifth edition, made by the 
addition of the Trapezoidal Rule, an additional method for finding the 
roots of numerical equations, and formulas and computing schedules for 
the numerical solution of simultaneous differential equations of the second 
order by the Runge-Kutta method. 


J.B.S. 




CONTENTS 


CHAPTER I 

THE ACCURACY OF APPROXIMATE CALCULATIONS 

ABTICLB PAGE 

1. Introduction 1 

2 . Approximate Numbers and Significant Figures 2 

3. Rounding of Numbers 2 

4. Absolute, Relative, and Percentage Errors 4 

5. Relation between Relative Error and the Number of Significant 

Figures 4 

6. The General Formula for Errors 8 

7. Application of the Error Formulas to the Fundamental Opera- 
tions of Arithmetic and to Logarithms 10 

8. The Impossibility, in General, of Obtaining a Result More 

Accurate than the Data Used 20 

9. Further Considerations on the Accuracy of a Computed Result 23 

10. Accuracy in the Evaluation of a Formula or Complex Ex- 
pression 24 

11. Accuracy in the Determination of Arguments from a Tabulated 

Function 28 

12. Accuracy of Series Approximations 32 

13. Errors in Determinants 39 

14. A Final Remark 40 

Exercises I 40 

(MI AFTER 11 
INTERPOLATION 

DIFFERENCES. NEWTON’S FORMULAS OF INTERPOLATION 

I.*). Introduction 46 

Ifi. Differences 48 

17. Effect of an Error in a Tabular Value 52 

18. Relation between Differences and Derivatives 54 

19. Difforeyces of a Polynomial 54 

20. N'ewton's Formula for Forward Interpolation 56 

21. Newton’s Formula for Backward Interpolation 59’ 

Exercises II 63 

xiii 



xhr CiONTENTS 

CHAPTER III 


INTERPOLATION WITH UNEQUAL INTERVALS 
OF THE ARGUMENT 

ABTIOLB PAGE 

22. Divided Differences 66 

23. Tables of Divided Differences 66 

24. Symmetry of Divided Differences 67 

25. Relation between Divided Differences and Simple Differences . . 68 

26. Newton’s General Interpolation Formula 70 

27. Lagrange’s Interpolation Formula 74 

Exercises III 77 


CHAPTER IV 

CENTRAL-DIFFERENCE INTERPOLATION FORMULAS 


28. Introduction 79 

29. Gauss’s Central-Difference Formulas 79 

30. Stirling’s Interpolation Formula 82 

31. Bessel’s Interpolation Formulas 84 

Exercises IV 90 


CHAPTER V 

INVERSE INTERPOLATION 


32. Definition 93 

33. By Lagrange’s Formula 93 

34. By Successive Approximations 93 

35. By Reversion of Series ^ 96 

Exercises V 101 


CHAPTER VI 

THE ACCURACY OF INTERPOLATION FORMULAS 


36. Introduction 102 

37. Remainder Term in Newton’s Formula (I) and in Lagrange’s 

Formula 102 

38. Remainder Term in Newton’s Formula (II) 104 

39. Remainder Term in Stirling’s Formula 105 

40. Remainder Terms in Bessel’s Formulas. i . . . . 106 



CONTENTS 


xv 


article pace 

41. Becapitulation of Formulas for the Remainder 107 

42 . Accuracy of Linear Interpolation from Tables 113 

Exercises VI 113 


CHAPTER VII 

INTERPOLATION WITH TWO INDEPENDENT VARIABLES 


TRIGONOMETRIC INTERPOLATION 

43. Introduction 114 

44. Double Interpolation by a Double Application of Single Inter- 
polation 114 

45. Double or Two-Way Differences 121 

46. A General Formula for Double Interpolation 122 

47. Trigonometric Interpolation 130 

Exercises VII 132 


CHAPTER VIII 


NUMERICAL DIFFERENTIATION AND INTEGRATION 


I. NUMERICAL DIFFERENTIATION 

48. Numerical Differentiation 133 

II. NUMERICAL INTEGRATION 


49. Introduction 

50. A General Quadrature P)prmula for Equidistant Ordinates . . . . 

51. Simpson’s Rule 

52. Weddle’s Rule 

52A. The Trapezoidal Rule 

53. Central-Difference Quadrature Formulas 

54? Gauss’s Quadrature Formula 

55 Lobatto’s FoAnula 

56. Tchebycheff’s Formula 

57. Euler’s Formula of Summation and Quadrature 

58. Caution in the Use of Quadrature Formulas 

59. Mechanical Cubature 

60. Prismoids and the Prismoidal Formula 

Exerefses VIII 


136 

136 

137 

138 
142 
144 
152 
159 
162 
165 
168 
172 
176 
180 



xvi 


CONTENTS 

CHAPTER IX 


THE ACCURACY OF QUADRATURE FORMULAS 

ABTICLB PAQB 

61. Introduction 183 

62. Formulas for the Inherent Error in Simpson’s Rule 183 

63. The Inherent Error in Weddle’s Rule 189 

64. The Remainder Terms in Central-Difference Formulas (63.1) 

and (63. 3) 189 

66. The Inherent Errors in the Formulas of Gauss, Lobatto, and 

Tchebycheff 191 

66. The Remainder Term in Euler’s Formula 192 

Exercises IX 193 

CHAPTER X 

THE SOLUTION OF NUMERICAL ALGEBRAIC AND 
TRANSCENDENTAL EQUATIONS 

I. EQUATIONS IN ONE UNKNOWN 

67. Introduction 194 

68. Finding Approximate Values of the Roots 194 

68A. Finding Roots by Repeated Appli(‘atioii of Location Theorem 195 

69. The Method of Interpolation, or of False Position (Regula Falsi) 197 

70. Solution by Repeated Plotting on a Larger Scale I 99 

71. The Newton-Raphson Method 201 

72. Geometric Significance of the Newton-Raphson Method 203 

73 . The Inherent Error in the Newton-Raphson Method 206 

74. A Special Procedure for Algebraic Equations 207 

75. The Method of Iteration 208 

76. Geometry of the Iteration Process 210 

77. Convergence of the Iteration Process 211 

78. Convergence of the Newton-Raphson Method 212 

79. Errors in the Roots due to Errors in the Coefficients and Con- 
stant Term 213 

II. SIMULTANEOUS EQUATIONS IN SEVERAL UNKNOWNS 

80. The Newton-Raphson Method for Simultaneous Equations. . . . 216 

81. The Method of Iteration for Simultaneous Equations 219 

82. Convergence of the Iteration Process in the (’ase of Several 

Unknowns 221 

Exerdics X .* 223 



CONTENTS 


XTii 

CHAPTER XI 

6RAEFF£*S ROOT-SQUARING METHOD FOR SOLVING 
ALGEBRAIC EQUATIONS 

ABTICLB PAQB 

83. Introduction 225 

84. Principle of the Method * 226 

86. The Root-Squaring Process 226 

86. Case I. Roots Real and Unequal 228 

87. A Check on the Coefficients in the Root-Squared Equation .... 232 

88. Case II. Complex Roots 234 

89. Case III. Roots Real and Numerically Equal 243 

90. Brodetsky and Smeal’s Improvement of Graeffe’s Method 246 

91. Improving the Accuracy of the Roots 257 

Exercises XI 259 

CHAPTER XII 

NUMERICAL SOLUTION OF SIMULTANEOUS LINEAR 

EQUATIONS 

L SOLUTION BY DETERMINANTS 

92. Evaluation of Numerical Determinants 260 

93. Cramer’s Rule 266 

II. SOLUTION BY SUCCESSIVE ELIMINATION OF THE UNKNOWNS 

94. The Method of Division by the Leading Coefficients 269 

96. The Method of Gauss 272 

96. Another Version of the Gauss Method 274 

m. SOLUTION BY INVERSION OF MATRICES 

97. Definitions 277 

98. Addition and Subtraction of Matrices 278 

99. Multiplication o^Matrices 279 

100. Inversion of Matrices 284 

101. Solution of^Equations by Matrix Methods 296 

IV. SOLUTION BY ITERATION 

102. Systems Solvable by Iteration 297 

103. Conditions for the Convergence of the Iteration Process 301 



zviii 


COKTENTS 


ARTICLE PAGE 

104. Errors in the Solutions when the Coefficients and Constant 

Terms are Subject to Errors 303 

Exercises XII 307 

CHAPTEK XIII 

I 

THE NUMERICAL SOLUTION OF ORDINARY 
DIFFERENTIAL EQUATIONS 

I. EQUATIONS OF THE FIRST ORDER 

105. Introduction 310 

106. Euler’s Method and Its Modification 310 

107. Picard’s Method of Successive Approximations 316 

108. Use of Approximating Polynomials 320 

109. Methods of Starting the Solution 327 

110. Halving the Interval ior h 334 

Exercises XIII 336 

II. EQUATIONS OF THE SECOND ORDER AND SYSTEMS 
OF SIMULTANEOUS EQUATIONS 

111. Equations of the Second Order 337 

112. Second-Order Equations with First Derivative Absent 342 

113. Systems of Simultaneous Equations 348 

114. Conditions for Convergence 360 

III. OTHER METHODS OF SOLVING DIFFERENTIAL 
EQUATIONS NUMERICALLY 

116. Milne’s Method 353 

116. The Kunge-Kutta Method 358 

117. Checks, Errors, and Accuracy 367 

118. Some General Remarks 368 

Exercises XIV 370 

IV. THE DIFFERENTIAL EQUATIONS OF EXTFRIOR BALLISTICS 

119. The Simplest Case — Flat Earth with Constant Acceleration of 

Gravity : 871 

120. The General Case, Allowing for Variation in Air Density with 

Altitude 374 

121. Methods of Finding the Starting Values 375 

Exercises XV > ‘ 390 



CONTENTS 
CHAPTER XIV 

THE NUMERICAL SOLUTION OF PARTIAL 
DIFFERENTIAL EQUATIONS 

ARTICLE 

122. Introduction 

I. DIFFERENCE QUOTIENTS AND DIFFERENCE EQUATIONS 

123. Difference Quotients 

124. Difference Equations 

II. THE METHOD OF ITERATION 

125. Solution of Difference Equations by Iteration 

126. The Inherent Error in the Solution by Difference Equations. . 

127. Applications of Conformal Transformation to Certain Problems 

III. THE METHOD OF RELAXATION 

128. Solution of Difference Equations by Relaxation 

129. Triangular Networks 

130. Block Relaxation 

131. The Iteration and Relaxation Methods Compared 

IV. THE RAYLEIGH-RITZ METHOD 

132. Introduction 

133. The Vibrating String 

134. Vibration of a Rectangular Membrane 

135. Comments on the Three Methods 

CHAPTER XV 

THE NUII|ERICAL SOLUTION OF INTEGRAL 
EQUATIONS 

136. Integral Equations — Definitions 

137. Boundary- Value Problems of Ordinary Differential Equations. 

Green’s Functions 

138. Linear Integral Equations 

139. Non-Liitear Integral Equations and Boundary-Value Problems 


xiz 


PAOB 

391 


392 

394 


396 

405 

407 


410 

415 

416 
420 


422 

423 
430 
435 


437 

438 
445 
450 



CONTENTS 


CHAPTER XVI 

THE NORMAL LAW OF ERROR AND THE PRINCIPLE 
OF LEAST SQUARES 

ASTIOLB PA0B 

140. Errors of Observations and Measurements 460 

141. The Law of Accidental Errors. . .' 460 

142. The Probability of Errors Lying between Given Limits 462 

143. The Probabilily Equation 464 

144. The Law of Error of a Linear Function of Independent Quan- 
tities 468 

145. The Probability Integral and Its Evaluation 473 

146. The Probability of Hitting a Target 476 

147. The Principle of Least Squares 481 

148. Weighted Observations 482 

149. Residuals 484 

150. The Most Probable Value of a Set of Direct Measurements. . . . 485 

151. Law of Error for Residuals 487 

152. Agreement between Theory and Experience 491 

Exercises XVI 492 

CHAPTER XVII 

THE PRECISION OF MEASUREMENTS 

153. Measurement, Direct and Indirect 493 

154. Precision and Accuracy 493 

I. DIRECT MEASUREMENTS 

155. Measures of Precision •. 494 

156. Relations between the Precision Measures 496 

157. Geometric Significance of n, r, and i) 497 

158. Relation between Probable Error and Weight, and the Probable 

Error of the Arithmetic and Weighted Means , 499 

159. Computation of the Precision Measures from the Residuals 500 

160. The Combination of Sets of Measurements 'When the p.b.’s of 

Sets Are Given 503 

Exercises XVII *. 509 

II. INDIRECT MEASUREMENTS 

161. The Probable Error of any Function of Independent Quantities 

Whose P.B.’s are Known •. 510 



CONTENTS 


Bd 

ABTIOU PAQB 

163. The Two Fundamental Problems of Indirect Measnrements. . . 513 

163. Bejection of Observations and Measurements 619 

Eiercises XVIII 630 

CHAPTER XVIII 

EMPIRICAL FORMULAS 

164. Introduction 638 

166. The Graphic Method, or Method of Selected Points 633 

166. T%e Method of Averages 638 

167. The Method of Least Squares 633 

168. Weighted Residuals 641 

169. Non-Linear Formulas — The General Case 646 

170. Determination of the Constants when Both Variables Are Sub* 

ject to Error 551 

171. Finding the Best Type of Formula 664 

173. Smoothing of Observational and Experimental Data 666 

Exercises XIX 668 

CHAPTER XIX 

HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS 

173. Introduction 664 

174. Case of 13 Ordinates 664 

176. Case of 34 Ordinates 574 

176. Periods other than 8*. . . 679 

Exercises XX 681 

•Appendix : Tables of Values of Probability Integral 583 

Index 687 

Answers to ExercsBS 696 




NUMERICAL MATHEMATICAL 
ANALYSIS 


chapter I 

THE ACCURACY OF APPROXIMATE CALCULATIONS 

1. Introduction. Since applied mathematics comes down ultimately to 
numerical results, the worker in applied mathematics will encounter all 
kinds of numbers and all kinds of formulas. He must be able to use the 
numbers and evaluate the formulas so .as to get the best possible result in 
any situation. What he learned about numerical calculation in his earlier 
study of arithmetic is inadequate for handling the numerical side of applied 
mathematics. For example, the numerical data used in solving the prob- 
lems of everyday life are usually not exact, and the numbers expressing 
such data arc therefore not exact. They are merely approximations, true 
to two, three, or more figures. 

Not only are the data of prac tical problems usually approximate, but 
sometimes the methods and processes by which the desired result is to 
be found are also approximate. An approximate calculation is one which 
involves approximate data, approximate methods, or both. 

It is therefore evident that the error in a computed result may be due 
to one or both of two sources : errors in the data and errors of calculation. 
Errors of the first type cannot be remedied, but those of the second type 
can usually be made as small as we please. Thus, when such a number as 
w is replaced by its approximate value in a computation, we can decrease 
the error due to the approximation by taking ir to as many figures as 
desired, and similarly in most other cases. We shall therefore assume in 
this chapter that the calculations are always carried out in such a manner 
as t<^ make the errors of calculation negligible. 

Nes.ily all numerical calculations are in some way approximate, and the 
aim of the computer should be to obtain results consistent with the data 
with a minimum of labor. The object of the present chapter is to set 
forth some basic ideas and methods relating to approximate calculations 
and to give methods for estimating the accuracy of the results obtained. 


1 



2 


ACCURACY OF APPROXIMATB CALCULATIONS 


[Chaf. I 


2. Approzimate Numbers and Significant Figures. 

(a) Approximate Numbers. In the discussion of approzimate compu- 
tation, it is convenient to make a distinction between numbers which are 
absolutely exact and those which express approximate values. Such 
numbers as 2, 1/3, 100, etc. are exact numbers because there is no approxi- 
mati<m or uncertainty associated with them. Although such numbers as 
vr, s, etc. are exact numbers, they' cannot be expressed exactly by a 
finite number of digits. When expressed in digital form, they must be 
written as 3.1416, 1.4142, 2.7183, etc. Such numbers are therefore only 
approximations to the true values and in such cases are called approximate 
numbers. An approximate number is therefore defined as a number which 
is used as an approximation to an exact number and differs only slightly 
from the exact number for which it stands.* 

(b) Significant Figures. A significant figure is any one of the digits 
1, 2, 3, ■ ■ ■ 9 ; and 0 is a significant figure except when it is used to fix 
the decimal point or to fill the places of unknown or discarded digits. 
Thus, in the number 0.00263 the significant figures are 2, 6, 3; the zeros 
are used merely to fix the decimal point and are therefore not significant. 
In the number 3809, however, all the digits, including the zero, are signifi- 
cant figures. In a number like 46300 there is nothing in the number as 
written to show whether or not the zeros are significant figures. The 
ambiguity can be removed by writing the number in the powers-of-ten 
notation as 4.63 X 10*, 4.630 X 10*, or 4.6300 X 10*, the number of 
significant figures being indicated by the factor at the left. 

3. Rounding of Numbers. If we attempt to divide 27 by 13.1, we get 
27/13.1 — 2.061068702- • -, 

a quotient which never terminates. In order to use such a number in a 
practical computation, we must cut it down to a manageable form, such 
as 2.06, or 2.061, or 2.06107, etc. This process of cutting off superfiuous 
digits and retaining as many as desired is called rounding off. 

To round off or simply round a number is to retain a certain number 
of digits, counted from the left, and drop the others. Thus, to round V)ff v 
to three, four, five, and six figures, respectively,^ He have 3.14, 3.142, 
3.1416, 3.14159. Numbers are rounded off so as to cause the least possSbU 
error. This is attained by rounding according to the following rule: 

* Some reedera may object to the term “ approximate number ** and insist that one 
should always say “ approximate value " of a number. The shorter term, however, 
is leas eumbrous, is perfectly definite as defined above, and reminds us by its vary 
nanm that it stands for the approximate value of a<number. It has bssa nssd in 
this sense by no lese an authority than Juice Tannery in hie Leeeas d*AH<beidt<fM. 



An. S] 


BOUNDING OF NUMBERS 


S 


To xoiind off a number to n significant figures, discard all digits to the 
right of the nth place. If the discarded number is less than half a unit 
in the nth place, leave the nth digit unchanged; if the discarded number 
is greater than half a unit in the nth place, add 1 to the nth digit. If 
the discarded number is txacily half a unit in the nth place, leave the nth 
digit unaltered if it is an even number, but increase it by 1 if it is an 
odd number; in other words, round off so as to leave the nth digit an 
even number in such cases. 

When a number has been rounded off according to the rule just stated, 
it is said to be correct to n significant figures. 

The following numbers are rounded off correctly to four significant 
figures: 

29.63243 becomes 29.63 
81.9773 « 81.98 

4.4996001 " 4.600 

11.64489 " 11.64 

48.366 48.36 

67.496 “ 67.50 

When the above rule is followed consistently, the errors due to rounding 
are largely cancelled by one another. 

Such is not the case, however, if the computer follows an old rule which 
is sometimes advocated. The old rule says that when a 6 is dropped the 
preceding digit should always be increased by 1. This is bad advice and 
is conducive to an accumulation of rounding errors and therefore to 
inaccura <7 in computation. It should be obvious to any thinking person 
that when a 6 is cut off, the preceding digit should be increased by 1 in 
only half the cases and should be left unchanged in the other half. Since 
even and odd digits occur with equal frequency, on the average, the rule 
that the odd digits be increased by 1 when a 6 is dropped is logically sound. 

The case where the number to be discarded is exactly half a unit in the 
nth place deserves further conunent. From purely logical considerations 
the ^git preceding Ij^e discarded 6000 * ■ * might just as well be left odd, 
but there is a practical wspect to the matter. Bounded numbers must often 
be divided by qther numbers, and it is highly desirable from the stand- 
point of accuracy that the division be exact as often as possible. An even 
number ia always divisible by 2, it may be divisible by other even numbers, 
and it may also be divisible by several odd numbers; whereas an odd 
nnmher is not divisible by any even number and it may not be divisible by 
ai^ odd number. Hence, ^n general, even numbers are exactly divisible 



ACCURACY OF APPROXIMATE CArX}ULATIONS 


[Chap. I 


by many more numbers than are odd numbers, and therefore there will 
be fewer left-over errors in a computation when the rounded numbers are 
left even. The rule that the last digit be left even rather than odd is 
thus conducive to accuracy in computation. 

In certain rare instances the rule for cutting off 50000 ■ * * should be 
modified. For example, if a 5 is to be cut off from two or more numbers 
in a column that is to be added, the preceding digit should be increased 
by 1 in hcUf the cases and left unchanged in the other half, regardless of 
whether the preceding digit is even or odd. Other cases might arise 
where common sense should be the guide in making the errors neutraliz# 
one another. 

4. Absolute, Relative, and Percentage Errors. The absolute error of 
a number, measurement, or calculation is the numerical difference between 
the true value of the quantity and its approximate value as given, or 
obtained by measurement or calculation. The relative error is the absolute 
error divided by the true value of the quantity. The percentage error is 
100 times the relative error. For example, let Q represent the true value 
of some quantity. If is the absolute error of an approximate value 
of Q, then 

^Q/Q “ relative error of the approximate quantity. 
lOO^Q/Q — percentage error of the approximate quantity. 

If a number is correct to n significant figures, it is evident that its 
absolute error can not be greater than half a unit in the nth place. For 
example, if the number 4.629 is correct to four figures, its absolute error 
is not greater than 0 001 X i “ 0.0006. 

Remark. It is to be noted that relative and percentage errors are 
independent of the unit of measurement, whereas absolute errors are 
expressed in terms of the unit used. 

5. Relation between Relative Error and the Number of Significant 
Figures. The belief is widespread, even in scientific circles, that the 
accuracy of a measurement or of a computed result is indicated by the 
number of decimals required to express it. This Belief is erroneous, for 
the accuracy of a result is indicated by the number of sigpiificant figures 
requiied to express it. The true index of the accuracy of a measurement 
or of a calculation is the relative error. For example, if the diameter of 
a 2-inch steel shaft is measured to the nearest thousandth of an inch, the 
result is less accurate than the measurement qf a mile of ibilroad track 
to the nearest foot. For although the absolute errors in the two measure- 



Akt. 6] 


RELATIVE ERROR AND SIGNIFICANT FIGURES 


6 


ments are 0.0006 inch and 6 inches, respectively, the relative errors are 
0.0005/3 1/4000 and 1/10,560. Hence in the measurement of the shaft 

we make an error of one part in 4000, whereas in the case of the railroad 
we make an error of one part in 10,560. The latter measurement is clearly 
the more accurate, even though its absolute error is 12,000 times as great. 

The relation between the relative error and the number of correct figures 
is given by the following fundamental theorem : 

Theorem I. If the first significant figure of a number is k, and the 
number is correct to n significant figures, then the relative error u less 
than l/(k X 10“"*). 

Before giving a literal proof of this theorem we shall first show that it 
holds for several numbers picked at random. Henceforth we shall denote 
absolute and relative errors of numbers by the symbols Ea and Er, 
respectively. 

Example 1. Let us suppose that the number 864.32 is correct to five 
significant figures. Then ir -= 8, n = 5, and Ea ^ 0.01 X i “ 0.005. For 
the relative error we have 

„ ^ 0-005 _5 1 

' = 864.32— 0.005 “ 864320 — 5 "" 2 X 86432 — 1 

1 < 1 

2(86432 — i) ^2 X 8 X 10‘ ^8 X 10* 

Hence the theorem holds here. 

Example 2. Next, let us consider the number 369,230. Assuming that 
the last digit (the zero) is written merely to fill the place of a discarded 
digit and is therefore not a significant figure, we have k <» 3, n » 5, and 
^ 10 X i — Then 

E < t 1 1 

= 369230 — 5 2 X. 36923 — 1 2(36923 — i) 

^2X3^ 10* ^3 X 10* 

Example S. Finally, suppose the number 0.0800 is correct to three 
significant figures. Then k^S, n — 3, E^^ 0.0001 X J 0.00005, and 

^ 0.00005 6 1 

= 0.0800 — 0.00005 “ 8000 — 5 ““ 1600 — 1 

J I •<-J 

2(800 — i) ^8 X 10* 



6 ACCURACY OF APHtOXIlCATB CALCULATIONS [Chat. 1 

It is to be noted that in this example the rdatiye error is not oertainlj 
less than l/(2h X as was the ease in Examples 1 and 2 abore. 

To prove the theorem generally, let 

N — ■ any number (exact value), 
n « number of correct significant figures, 
m — number of correct decimal places. 

Three cases must be distinguidied, namely m < n, m — n, and m > n. 

Case 1. m < n. Here the number of digits in the integral part of N 
is n — tn. Denoting &e first significant figure of by h, as l^fore, we 
have 

E.^l/lO-Xi 3r^hX10-^» — 1/10-X J. 

Hence 

„ ^ 1/10* X i 10-* 

' = h X — 1/10" X i X 10*-» X 10-" — 10 " 

1 _ 1 

“■2fcX 10"-‘ — 1 2(iX 10"-" — i) ' 

Bemembering now that n is a positive integer and that h stands for any 
one of the digits from 1 to 9 inclusive, we readily see that 2h X 10*~" — 1 
> h X 10*-" in all cases except h — 1 and n — 1. But this is the trivial 
case where fiT— 1, 0.01, etc.; that is, where N contains only one digit 
different from zero and this digit is 1 — a case which would never occur 
in practice. Hence for all other cases we have 2h X 10"*" — 1 > h X lO**",. 
and therefore 

kX 10 "-". 

Case £. m — n. Here fiT is a decimal and h is the first decimal figure. 
We then have 

E.^l/10"Xi, WShXlO-" — l/10"Xi. 

• r ^ * - 10~* * * _ 1 

• '=4X10-" — 10-"Xi 24X10-" — 10-* !|4X10"-" — 1 

I <_1 

24X10*-" — 1 ^ 4X10"-* 

Case S. m > n. In this ease 4 occupies the (m — n l)th 
place and therefore * 



An. 6] 


RELATIVE ERROR AND SIGNIFICANT FIGURES 


T 


N^kX — 1/10-* X 4, ^ I/IO* X 4 . 

. g < 10^X4 1®:!! 

■ * '—kx 10-'" X 10 "-* — 10 -" X 4 2fc X 10-*" X 10 "-* — 10 -» 
_ 1 ^ 1 
2k X 10“-* — 1 i; X 10"-* 

The theorem ie therefore true in all cases. 

Corollary 1. Except in the case of approximate numbers of the form 
h(1.000’ * *)X 10^ in which k is the only digit different from zero, the 
relative error is less than l/{2k X 10*-*). 

Corollary S. If h ^ 5 and the given approximate number is not of the 
form h(1.000 * * *) X 10', then Er < 1/10”; for in this case 2k ^ 10 and 
therefore 2k X 10“"* ^ 10". 

To find the number of correct figures corresponding to a given relative 
error we can not take the converse of the theorem stated at the beginning 
of this article, for the converse theorem is not true. In proving the 
formula for the relative error we took the lower limit for N in order to 
obtain the upper limit for Ef. Thus, for the lower limit of N we took 
its first significant figure multiplied by a power of 10. In the converse 
problem of finding the number of correct figures corresponding to a given 
relative error we must find the upper limit of the absolute error 
and since Ea — N'Er, we should use the upper limit for N. This upper 
limit will be k -|- 1 times a power of 10, where k is the first significant 
figure in N. For example, if the approximate value of N is 6895, the 
lower limit to be used in finding the relative error is 6 X 10*, whereas the 
upper limit to be used in finding the absolute error is 7 X 10*. 

To solve the converse problem we utilize Theorem II: 

Theorem 11. If the relative error in an approximate number is less 
than l/[(k -f 1) X 10"-*], the number is correct to n significant figures, 
or at least is in error by less than a unit in the nth significant figure, 

T prove this theorem let 

N ~ the given number (exact value), 
n — number of correct significant figures in E, 
k*— first significant figure in N, 
p mm number of digits in the integral part of N, 

Then 

fi — p — i number of decimals in E, 
and E^ (fc-f-l)XlO'-*.* 



8 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. 1 


Let 

(t + l)X10-»’ 

Then 


^.«* + »)X»0"‘X (t + l)X10- 


1 

1G"-P ’ 


Now 1/10*^ is one unit in the (n — p)th decimal place, or in the nth 
significant figure. Hence the absolute error is less than a unit in the 
nth significant figure. 

If the given number is a pure decimal, let 

p — number of zeros between the decimal point and first significant 
figure. Then n -f* p ^ number of decimals in N, and 


Hence if 


we have 


N 


< (fc + l) 

— 


Er< 


(ir + l)X10-» 




io«^‘ ^(ifc + i)xio"-‘ 


1 

10 "«» ' 


But 1/10*^ is one unit in the (n-|-p)th decimal place, or in the nth 
significant figure. Hence the absolute error Ea is less than a unit in the 
nth significant figure. 

Corollary S. If Er < l/[2(i + 1)X lO*"*], then E^ is less than half 
a unit in the nth significant figure and the given number is correct to n 
significant figures in all cases. 

Corollary Since 1e may have any value from 1 to 9 inclusive, it is 
evident that may have any value -from 2 to 10. Hence the upper 

and lower limits of the fraction l/[2(Ar + 1)X are 1/(4 X 10"~*) 

and 1/(2 X 10*)) respectively. We can therefore assert that 


If the relative error of any number is not greater than 1/(2 X lO**; the 
number is certainly correct to n significant figures^ * 

Remark. The reader can readily see from the preceding discussion that 
the absolute error is connected with the number of decimal places, whereas 
the relative error is connected with the number of significant figures. 


6. The General Formula for Errors. Let 
(1) W — /(«i,«„u„- • - ft) 



An. 6] 


GENERAL FORMULA FOR ERRORS 


9 


denote any function of several independent quantities Uj, Uj, * * * Ua, which 
are subject to the errors Atti, Au 2 , * * * respectively. These errors in 
the tt*s will cause an error AN in the function N, according to the relation 

(2) N + AN ^ f(Ui 4- AUi, tij 4- AUi, • • • Ua 4- Atta). 

To find an expression for AN we must expand the right-hand member 
of (2) by Taylor’s theorem for a function of several variables. Hence 
we have 

?/ 

/(Ui 4* ^2 ■{" ^^2> ■ ' * % + AUa) / (Mi, tt 2 > ' * ■ tta) -|- Atii ■ 

+ + ■ ■ ■ + ^“"^ + ^ + 

+ sifc + • ■ ■3 + - ••• 


Now since the errors Aui, Att 2 ) ‘ ‘ ' AUa are always relatively small,* we 
may neglect their squares, products, and higher powers and write 


(3) 


JV 4- Ai\r — f(Ui,U2, U», • • • «a) 


4- 


CUi 


4* 


i/ 

du2 


4- • • • 4- AOa 


0tia 


Subtracting (1) from (3), we get 


or 

( 6 . 1 ) 


AN. 


dtti 




Att2 4- 


^aua 


AUa> 


dN ^ , dN ^ , dN ^ , 


, . 


This is the general formula for computing the error of a function, and 
it includes all possible cases. *It will be observed that the right-hand 
member of (6. 1) is merely the total differential of the function N. 

For the relative error of the function N we have 


(6.2^ 


, AN Atti , 

"" N " 0tta N 


When N is a functidn of the form 


dN Au 2 
du, N 


, 0N ^ 

dUn’N • 


( 6 . 8 ) 


Ea,H*tP 

” — W-’ 


* A quantity F ia said to be relatively small in comparison with a second quantity 
Q when the ra^io P/Q is small in comparison with unity. The squares and products 
of such small ratios ars negligible in most calculations. 



10 


ACCURACY OF APPROXDIATB CALCULATIONS 


[Ghat. 1 


then by (6. 2) the rektiTe error is 

Er — AN/N — m(Aa/a) + n(Ab/b) + p(Ae/e) — q(Ad/d) — r(Ae/e). 

But since the errors Aa>‘ * *Ae, etc. are just as likely to be negative as 
positive, we must take all the terms with the positive sign in order to be 
sure of the maximum error in the function N. Hence we write 

(6. 4) Er^fn\ Aa/a | + *^ I I + P | | + 9 1 I + ** I | • 

7. Application of the Error Formulas to the Fundamental Operations 
of Arithmetic and to Logarithms. We shall now apply the preceding 
results to the fundamental operations of arithmetic. 

7a). Addition. Let 

— ttt + «i + • • • + «». 

Then 

(7.1) AY«E« — A«i + At(a + ’ * * + A«m. 

The absolute error of a sum of approximate numbers is therefore equal to 
the algebraic sum of their absolute errors. 

The proper way to add approximate numbers of different accuracies is 
shown in the two examples below. 

Example 1. Find the sum of the approximate numbers 661.32, 491.6, 
86.954, and 3.9462, each being correct to its last figure but no farther. 

SoltUion. Since the second number is known only to the first decinud 
place, it would be useless and absurd to retain more than two decimals 
in any of the other numbers. Hence we round them off to two decimals, 
add the four numbers, and give the result to one decimal place, as shown 
below: 

491.6 ' 

661.32 

86.96 

3.96 

1143.8 

By retaining two decimals in the more accurate numbers we eliminate 
the errors inherent in these numbers and thus reduce the error of the sum 
to that of the least accurate number. The final result, however, is uncertain 
by one unit in its last figure. 

Example B. Find the sum of 36490, 994, 667.32, 29600, and 86939, 
iMMiiTning that the number 29500 is known to only three significant figures. 



Am. 7J 


ADDITION 


II 


Solution. Since one of the numbers is known only to the nearest hundred, 
we round off the others to the nearest ten, add, and give the sum to hun- 
dreds, as shown below: 


29500 

86940 

36490 

990 

560 


154500 or 1.545 X 10». 

The result is uncertain by one unit in the last significant figure. 

In general, if we find the sum of m numbers each of which has been 
rounded off correctly to the same place, the error in the sum may be as 
great as m/2 units in the last significant figure. 

7b). Averages. An important case in the addition of numbers must 
here be considered. Suppose we are to find the mean of several approxi- 
mate numbers. Is this mean reliable to any more figures than are the 
numbers from which it was obtained? The answer is yes, but in order to 
see why let us consider the following concrete case. 

The first column below contains the mantissas of ten consecutive 
logarithms taken from a six-place table. The second column contains these 
same mantissas rounded off to five decimals. The third column gives the 
errors due to rounding, expressed in units of the sixth decimal place. 


N 

N' 

B 

0.961421 

0.96142 

1 

0.961469 

0.96147 

— 1 

0.961516 

0.96152 

— 4 

0.961 r>63 

0.96156 

3 

0.961611 

0.96161 

1 

0.961658 

0.96166 

— 2 

0.961706 

0.96171 

— 4 

0.961753 

0.96175 

3 

0.961801* 

0.96180 

1 

0.961848 

A 

0.96185 

— 2 

Average, 0.9616346 
— 0.961635 

Av., 0.961635 

Sum, — 4 
Av., — 0. 


Here we have the relation 


2 



12 


ACCURACT OF APPROXIMATB CALCULATIONS 


[Chat. I 


for each of the numbers and therefore the further relations 


and 


jy/n — X»r/n + J»/n. 


It will be noticed that the average of the rounded numbers is in error 
by only 0.4 of a unit in the sixth decimal place. We may therefore call it 
correct to six decimals, or to one more place than the rounded numbers. 

The entries in all numerical tables and the results of all measurements 
are rounded numbers in which the error is not greater than half a unit 
in the last significant, figure. These errors (due to rounding) are in 
general as likely to be positive as negative and hence their algebraic sum 
is never large. Usually it is less than a unit in the last figure. 

The foregoing considerations justify the computer in retaining one more 
figure in the mean of a set of numbers than are given in the numbers 
themselves. But rarely should he retain the mean to more than one 
additional figure. 

7c). Subtraction. Here 

N •mUt — «t 

and 

(7.2) — — Att,. 

Since the errors Aui and Au, may be either positive or negative, however, 
we must take the sum of the absolute values of the errors in order to get 
the maximum error. We then have the result that the absolute error of 
the difference of two approximate numbers may equal the sum of their 
absolute errors. 

When one approximate number is to be subtracted from another, they 
must both be rounded off to the same place before subtracting. Thus, to 
subtract 46.365 from 779.8, assuming that each number is approximate and 
correct only to its last figure, we have 

779.8 — 46.4 — 733.4. 

It would be absurd to write 779.800 — 46.365 — 733.436, because the last 
two figures in the larger number as here written are not zbros. 

7d). Loss of Significant Figures by Subtraction. 

The most serious error connected with the subtraction of approximate 
numbers arises from the subtraction of numbers which are nearly equal. 
Suppose, for example, that the numbers 64.395 and 63.994 are each correct 



An. 7] 


SUBTRACTION 


13 


to five figures, but no more. Their difference, 64.395 — 63.994 — 0.401, 
is correct to only three figures. Again, if the numbers 16950 and 16870 
are each correct to only four significant figures, their difference 16950 — 
16870 — 80 is correct to only one significant figure, and even this figure 
may be in error by one unit. 

Errors arising from the disappearance of the most important figures on 
the left, as in the two examples of the preceding paragraph, are of frequent 
occurrence and sometimes render the result of a computation worthless. 
They must be carefully guarded against and eliminated wherever possible. 

The inaccuracy resulting from the loss of the most important significant 
figures in the subtraction of two nearly equal numbers can be lessened, and 
sometimes entirely avoided, in one of two ways : 

1. By approximating each of the numbers with sufficient accuracy 
be fore subtraction, when this is possible. Thus, to find the difference 
V2.03 — V2 correct to five significant figures, we take V2.03 — 1.424781 
and *= 1.414214. Then 1.424781 — 1.414214 — 0.010567. Note that 
a slide-rule computation is worthless in such a case as this. 

This method is limited when the two given numbers are approximate and 
true to only a few digits. 

2. By transforming the expression whose value is desired. Thus, to find 
the value of 1 — cos x when x is small and no extended table is at hand, 
write 1 — cosx — 2sin^ (^/2) in some cases, and in other cases replace 
cos X by its Taylor expansion. Then 


1 


cos 2 — 1 — (1 




In finding the area of a circular segment having a small central angle, 
replace sin 9 by its Taylor expansion. Thus 


K* R* 9 ’ 9 * 

[*-(«- JT + ll- • >1 

8 V 6 120 ^ /’ 


otherwise the area of a plainly visible segment might turn out to be zero 
when 4- or 5-place tables are used. _ 

Sometimes in t^e evaluation of such an expression as Vo — V where 
b is only slightly less than a, one or more significant figures can be saved 
by rationalizing the expression as the first step in the calculation. Thusi, 


Vo-* V6 — 


a — 6 

Va+Vb’ 



14 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. 1 


T his method is of value only when fewer digits are lost by taking a — b 
than by taking V® — V 6. 

The general solution of a certain type of ladder problem in elementary 
mechanics is 

W 

jCOttf— W/i 

Here the terms in the numerator may be nearly equal for particular values 
of W, $, and and the terms of the denominator may also be nearly equal 
for certain values of^ I, c, and 6. In such cases a slide-rule computation 
may be worthless. 

In making a transformation to prevent loss of significant figures by sub- 
traction, each problem must be treated individually. There is no known 
method or procedure that will fit all cases. 

The loss of the leading significant figures in the subtraction of two nearly 
equal numbers is the greatest source of inaccuracy in most computations, 
and it forms the weakest links in a chain computation where it occurs. The 
computer must be on his guard against it at all times. 

In general, if we desire the difference of two approximate numbers to 
n significant figures, and if it is known beforehand that the first m figures 
at the left will disappear by subtraction, we must start with m n signifi- 
cant figures in each of the given numbers. 

7e). Multiplication. In this case 

N — UiUfU, • • • tt, . 

Since this is of the form (6. 3), in which m^n^- • • — r — 1, we have 
by (6. 4) 

(7. 3) Er — AN/N — AUi/fti -f Attj/ua -!-••• + Aii,/u, . 

The relative error of a product of n approximate numbers is therefore 
equal to the algebraic sum of the relative errors of the separate numbers. 

The accuracy of a product should always be investigated by means of the 
relative error. The absolute error, if desired, can be found from the relation 
Ea^ErN. 

When it is desired to find the product of two or more approximate 
numbers of different accuracies, the more accurate numbers should be 
rounded off so as to contain one more significant figure than the least 
accurate factor, for by so doing we eliminate the error due to the more 
accurate factors and thus make the error of the product due solely to the 



Abt. 7] 


DIVISION 


IS 


errors of the less accurate numbers. The final result should be given to 
as many significant figures as are contained in the least accurate factor, 
and no more. The proper method of procedure in such cases will be illus- 
trated by examples later on. 

7/). Division, Here we have 

N — 

This is also of the form (6. 3), where the exponents are all unity. Hence 
by (6.4) 

(7. 4) Ef — Auii/tti -f Awj/ua • 

The relative error of a quotient is therefore equal to the algebraic sum ot 
the relative errors of divisor and dividend, but in order to get the maximum 
error one should take the arithmetical sum of the errors. 

A simple formula for the absolute error of a quotient can be found 
directly, as follows: 

Let A^ »» absolute error of the quotient Ui/Ua. Then 

AUt _ AUa X 

«| Un ) 

« 2 +Au* tta «a(«*4-^Wa) «a + Atta 

Now let w denote the greatest absolute value of cither Au,/«x or Aua/u-, 
and take the signs of Au, and At^. so as to get the greatest value of AQ. 
Then since Awa/tia ^ w, we have Aua ^ and therefore if and Ua are 
loth subject to errors of the same order of magnitude we have 

■< ~l' «e) __ 

™ tta — ■wU2 tt2(\ — ») * 

If only Ui or Ua 'S subject to error and the other is free from error in 
comparison with it, then 

, 

tta — «Wa V.a(l — u») 

Finally, if <■> is negli^ble in comparison with 1, we get 
(7.6) Ag^2(iii/«a)® 

if and Ut are both subject to errors of the same order of magnitude ; and 
(7. 6) A^ g («i/«*)® 

if only Ui or tta is subject to* error. 


til 4' Ait| 


til 


UaAUi — tiiAtta 


“■( 



16 ACCURACY OF APPROXIMATE CALCULATIONS [Chap. I 

As in the case of products, the accuracy of a quotient should always be 
inrestigated by means of the relative error, and all the statements made 
above in regard to products hold for quotients. In particular, if one of 
the numbers (divisor or dividend) is more accurate than the other, the 
more accurate number should be rounded off so as to contain one more 
significant figure than the less accurate one. The result should be given 
to as many significant figures as the less accurate number, and no more. 
The following examples will illustrate the proper methods of investigating 
the accuracy of products and quotients. 

Example 1. Find the product of 349.1 X 863.4 and state how many 
figures of the result nre triistworthy. 

Solution. Assuming that each number is correct to four figures but 
no more, we have Au, ^ 0.05, Aua ^ 0.05. Hence 

0-000143 + 0.000057 — 0.00020. 

The product of the given numbers is 301413 to six figures. The absolute 
error of this product is 

Ea — 301413 X 0.00020 — • 60, possibly. 

The true result therefore lies between 301473 and 301353, and the best 
we can do is to take the mean of these numbers to four significant 
figures, or 

349.1 X 863.4 — 301400 — 3.014 X 10*. 

Even then there is some uncertainty about the last figure. 

Theorem II of Art. 5 also tells us that the above result is uncertain in 
the fourth figure, but that the error in *lhat figure is less than a unit. 

Example £. Find the number of correct figures in the quotient 
56.3/ V 6, assuming that the numerator is correct to its last figure but 
no farther. 

Solution. Here we take V 5 — 2.236 sc as to make the divisor free 
from error in comparison with the dividend. Then 

0.0009; 

and since 56.3/2.236 « 25.2 we have 


E^ < 26.2 X 0.0009 < 0.023. 



An. 7] 


DIVISION 


17 


Since this error does not affect the third figure of the quotient, we take 
25.2 as the correct result. 

Note that formula (7. 6) also gives this result. 

We could have seen at a glance, without any investigation, that the error 
of the quotient in this example would be less than 0.025 ; for the denomi* 
nator is free from error and the possible error of 0.06 in the numerator is 
to be divided by 2.236, thereby making the error of the quotient less than 
half that amount. 


Example S. Find how many figures of the quotient 4.89ir/6.7 are trust* 
worthy, assuming that the denominator is true to only two figures. 

Solution. The only appreciable error to be considered here is the possible 
0.05 in the denominator. The corresponding relative error is 

Er ^ ^ < 0.0075. 

0.7 

The quotient to three figures is 


4.a . 9X3.14 _,,^ 

6.7 

Hence the possible absolute error is Ea ^ 2.29 X 0.0075 < 0.02. Since the 
third figure of the quotient may be in error by nearly two units, we are 
not justified in calling the result anything but 2. 3, or 

6.7 


Formula (7. 6) also gives this same result. 

Example 4. Find the number of trustworthy figures in the quotient of 
876.3/494.2, assuming that both numbers are approximate and true only 
to the number of digits given. 


Solution. Here the largest relative error is 


and the quotient is 


Hence by (7. 5)* 


0.05 

494.2 


0 . 000101 , 


876.3 

494.2 


1.7732. 


AQ — 2(1.7732) (0.000101) — 0.000358. 


Since this eriipr affects the fourth decimal place but not the third, we take 
the quotient to be 1.773. 



18 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chat. I 


Note. The greatest and least values of the above quotient are 


876.35 

494.15 


1.7734 and 


876.25 

494.25 


1.7729. 


These values agree to four significant figures and both give 1.773. 
7g). Powers and Roots. Here N h^a the form 


iV — u". 

Hence by (6. 4) 

Er ^ m(Au/tt). 

For the pth power of a number we put m=‘ p and have 

Er ^ p(Au/m). 

The relative error of the pth power of a number is thus p times the 
relative error of the given number. 

For the rth root of a number we put m — l/r and get 


Hence the relative error of the rth root of an approximate number is only 
3/rlh of the relative error of the given number. 

Example. Find the number of trustworthy figures in (0.3862)*, as- 
suming that the number in parentheses is correct to its last figure but 
ro farther. 


liohMon 


Here the relative error of the given number is 


Er-- 


0.00005 

0.3862 


< 0.00013. 


The relative error of the result is therefore less than 4 X 0.00013, or 
0 000.52. 

The required number to five figures is (0.3862)* — 0.022246. Hence 
the absolute error of the result is 0 022246 X 0.00052 — 0.00001 2. Since 
this error affects the fourth significant figure of the result, the best we 
can do is to write 

(0.3862)* — 0.02225 


and say that the last figure is uncertain by one unit. 

The relative error of the fourth root of 0.3862 is less than ^(0.00013) 
— 0.000032, and since this fourth root is 0.78832 the absolute error of 
the result is about 0.78832 X 0.000032 — 0.000026. Henee the fourth 
root is 0.7883 correct to four figures. 



ABT. 7] 


LOGARITHMS 


19 


7h). Logarithms. Here we have 


Hence 

or 


N — logio u 0.43429 log* u. 
— 0.43429 (Att/u), 


The absolute error in the common logarithm of a number is thus less 
than half the relative error of the given number. 

An error in a logarithm may cause a disastrous error in the anti- 
logarithm or corresponding number, for from the first formula for aAT 
above we have 


The error in the antilog may thus be many times the error in the loga- 
rithm. For this reason it is of the utmost importance that the logarithm 
of a result be as free from error as possible. 

Example t. Suppose N = log,* u ■==• 3.49853 and AiV < 0.000005, so 
that the given logarithm is correct to its last figure Then u = 3151.6 
and therefore 

Am -= 2.3 X 3151.6 X 0.000005 = 0.036. 

Since this error docs not alEoct the fifth figure in u, the antilog is correct 
to five figures. 

Exatnple 'J. Siipiw&e N log,o u 2.96384 and AiV = 0.00001. Then 
?<,_= 920.11 and 

Am ™ 2.3 X 920.11 X 0.00001 = 0.021. 

This error affects tiie fifth figure in m and makes it uncertain by two units. 

In^much as the logarithm of most results is obtained by the addition 
of other logarithms,, it is evident that .such a logarithm is likely to be 
in error by a unit iif the last figure, due to the addition of rounded 
numbers. Heupe the corresponding number may frequently be in error 
by one or two units in its last significant figure when the number of 
significant figures in the antilog is the same as the number of decimals 
in the logarithm. 

Remarks. 'The reader should bear in mind the fact that the number 
of correct figures in the antilog corresponds to the number of correct 



20 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


decimals in the logarithm. The integral part, or characteristic, of the 
logarithm plays no part in determining the accuracy of the antilog. This 
fact is at once evident from a consideration of the equation 

Att/tt — 2.3AN. 

For inasmuch as the number of correct figures in the antilog u is mea* 
sured by its relative error, and since this latter quantity depends only on 
the absolute error AN and not at all on the characteristic, it is plain that 
the accuracy of the antilog depends only on the number of correct decimals 
in the mant'ssa. 

It is an easy matter to determine the number of correct figures in any 
antilog when the number of correct decimals in the mantissa is given. 
Suppose, for example, that we are using m-place log tables and that the 
possible error in the logarithm of a result is one unit in the last decimal 
place, as is usually the case. Then AN — 1/1 O’" and we have 

. , 2-3 1_ 

10" 10 X lO*"-* “ 4.34 X 10’"-‘ 2 X lO*"-* 

Hence by Corollary 4, Art. 5, the antilog u is certainly correct to m — 1 
significant figures. 

The equation Au/« — l/(4.34 X 10’""*) shows that if the mantissa is 
in error by two units in its last figure the antilog is still correct to m — 1 
significant figures, for in this case the relative error of the antilog is 


^ 2.17X 10”“’ ’ 

which is less than 1/(2 X iKfore justified in asserting 

that if the mantissa of is not^i^€rf(h by more than two units 

in the last decimal placeA^ ^ntilog is ccrtal^i nrrect to m — 1 signifi* 
cant figures. n » I 1^1 


8. The Impossibility, a Result More Ac- 
curate than the Data Uset^'^^he jeadfi P^iiM^ave observed that in all 
the examples worked in no result has been more 

accurate than the numbers usedi^^SSfm!^ it. This, of course, is what 
we should have expected, but sometimes computers seeip to try to get 
more figures in the result than are used in the data. When we apply 
Corollaries 1 and 4 of Art. 5 to the errors of products, quotients, powers, 
roots, logarithms, and antilogarithms, we find that in no case is the result 
true to more figures than are the numbers u^d in computing it. The 
results for these operations are as follows: 



Avr. 8] 


RESULTS NOT MORE ACCURATE THAN DATA 


21 


(a) Products and Quotients. If hi and ht are the first significant 
figures of two numbers which are each correct to n significant figures, 
and if neither number is of the form Jfc(1.000- • •) X lO**, then their 
product or quotient is correct to 

n — 1 significant figures if fci ^ 2 and jfe* ^ 2, 
n — 2 significant figures if either Ati — 1 or ft* 1. 

(b) Powers and Roots. If k is the first significant figure of a number 
which is correct to n significant figures, and if this number contains more 
than one digit different from zero, then its pth power is correct to 

n — 1 significant figures if p ^ k, 
n — 2 significant figures if p ^ 10k ; 

and its rth root is correct to 


n significant figures if rk ^ 10, 
n — 1 significant figures if rk < 10. 


(c) Logs and Antilogs. If k is the first significant figure of a number 
which is correct to n significant figures, and if this number contains 
more than one digit different from zero, then for the absolute error in 
its common logarithm we have 

P ^ L 

“ ^ 4k X 10"-‘ ' 

If a logarithm (to the base 10) is not in error by more than two units 
in the mth decimal place, the antilog is certainly correct to m — 1 signifi- 
cant figures. 

To prove the foregoing resuHs for the accuracy of products and quotients, 
let k, and k 2 represent the first significant figures of the given numbers. 
Then by Corollary 1 of Art. 5 the relative errors of the numbers are less 
than l/(2ki X 10""‘) and l/( 2 k 2 X 10"'*), respectively; and since the 
relative error of the product or quotient of two numbers may equal the 
sun* of their relative errors, we have 
Kelative error of*i;psult 


< 


» 1 

2ki X 10"-* 


+ 


1 

2kj X r0"“‘ 



+ 


. 1 * 2/2 X 10-‘ ■ 


Now if (1/ki -f l/kj) g 1 we have Er < 1/(2 X 10"-0> the product 
or quotient is certainly correct to n — 1 significant figures. But this 
quantity is not greater than 1 if k, ^ 2 and ks ^ 2. Hence in this case 
the result is correct to n — 1 significant figures. If, however, either ki « 1 



22 


ACCURACY OF APPROXIMATE CALCULATIONS 


(Chat. I 


or Iti — 1, the quantity {1/ki 4- lAi) > 1 and therefore the lelatiTe error 
of the result may be greater than 1/(2 X 10”'^). Hence the result may 
not be correct to n — 1 significant figures, but it is certainly correct to 
n — 2 figures. 

To prove the above results for the accuracy of powers and roots let h 
represent the first significant figure of the given number. Then the relative 
error of this number is less than 1/(21; X 10”~^). Hence the relative error 
of its pth power is less than 

P _P L 

2k X 10“-^ k 2 X 10“-‘ 

The result will therefore be correct to n — 1 significant figures if (p/1;)^ 1, 
or p ^ 1;, and to n — 2 significant figures if p ^ lOit. 

The error of the rth root is less than 


1 3 _ 1 1 10 1 

r 2k X 10 »-‘ rk 2 X 10 "-^ ““ rJfc 2 X 10 ^ * 

Hence the result will be correct to n significant figures if rk ^ 1-'' •• 

n — 1 significant figures if fir < 10. 

To prove the result for the error of the common loganth”' we recall 
that AN < 4(A«/u), and since Au/u < 1/(21; X we have 


an < 


1 

4Je X 10 *-^’ ‘ 


The proof for the accuracy of the antilog has already been given at the 
end of Art. 7. 

Since the separate processes of multiplication, division, "aieing to powers, 
ai d extraction of roots can not give a result more accurate than the deln 
used in obtaining it, no combination of thfise procersea could be expect* » 
to give a more accurate result except by accident. Hence when only i re- 
processes are involved in a computation, the result should never be given 
to more significant figures than are contained in the least accurate of the 
factors used. Even then the last significant figure will usually he uncertain. 
Tn a computation involving several distinct steps, retain at the end of each 
step one more significant figure than is required in the final result. 

While it is true in general that a computed result is not«more accurate 
than the numbers used in obtaining it, an exception must be made in the 
eases of addition and subtraction. When only these processes are involved, 
the result may be much more accurate than oim of the quantities added or 
subtracted. For example, the sum 3463 -f* V3 ■"’ 3463 -f- 1.7 w 8464.7 is 
correct to five significant figures (assuming 3463 to be an exact number) 



Abt. 0] 


ACCURACY OF A COMPUTED RESULT 


23 


even though one of the numbers used in obtaining it is correct to only two 
figures. A similar result would evidently follow in the case of subtraction. 

9. Further Considerations on the Accuracy of a Computed Result. 
In commenting on formulas (7.1) and (7.3), it was stated that the 
absolute error of a sum is equal to the algebraic sum of the errors of the 
numbers added, and that the relative error of a product is equal to the 
algebraic sum of the relative errors of the factors. The word “ algebraic ” 
deserves emphasis in these cases because errors of measuremnt and errors 
due to rounding are compensating to a very great extent, so that in most 
cases the error in a computed result is not equal to the arithmetical sum 
of the errors of the numbers from which the result was obtained. 

We saw in (7b) that the error in a sum was only a small fraction of 
the arithmetic sum of the separate errors. That the errors of the factors 
in a product are also compensating may be seen by considering the product 
of two exact numbers: 


649.3 X 675.8 = 438,796.94. 


Now suppose we round off these numbers to 649 and 676. Their product 
is then 649 X 676 = 438,724. The actual error of this product is 72.94, 
and the relative error is 


72.94 

438,796.94 


0.000166. 


The relative errors of the factors are 0.3/649.3 = 0.000462 and — 0.2/675.8 
— — 0.000296. The relative error of the product is thus less than the 
relative error of either factor and is actually equal to their algebraic sum. 
The product in this case is more accurate than either factor. 

When a long computation is carried out in several steps and the inter- 
mediate results are properly rbunded at the end of each step, there is no 
accumulation of rounding errors. If there were, long astronomical com- 
putations, such as those of eclipses and the orbits of comets, would be 
worthless. Time and experience have proved the correctness of such 
astsDnomical computations. In a chain computation the loss of significant 
figures by subtraction is the chief source of error. 

Bad advice is som^imes given in regard to computation. In the addi- 
tion of numbers of unequal accuracy, some writers advise that all the 
numbers first be rounded off to the number of decimal places given in. 
the least accurate number. When this is done, the computer throws away 
definite information and replaces it with uncertainty. In adding a column 
of several numbers, the uncertainties might largely cancel one another, 
but this would not be the case with only a few numbers. The proper 



24 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. 1 


method is to add the more accurate numbers separately and then round 
off their sum to the same decimal place as the least accurate number or 
numbers. In this way, the sum is as accurate as the least accurate of the 
numbers added. 

Similar bad advice is given in the case of multiplication and division. 
When multiplying or dividing numbers of unequal accuracy, some writers 
advise that all numbers first be rounded off to the same number of signifi- 
cant figures as contained in the least accurate factor. To make all factors 
as rough as the roughest one is folly. There is no sense in throwing 
away perfectly definite information and replacing it with a question mark. 
The more accurate factors should be kept with one more significant figure 
than the least accurate factor. Then the result will usually be as accurate 
as the least accurate factor. The correct procedure in all ordinary com- 
putations can be stated in 

A Sound and Safe Rule : When computing with rounded or approximate 
numbers of unequal accuracy, retain from the beginning one more signifi- 
cant figure in the more accurate numbers than are contained in the least 
accurate number. Then round off the result to the same number of 
significant figures as the least accurate number. 

In the case of addition, retain in the more accurate numbers one more 
decimal digit than is contained in the least accurate number. 

This rule follows from equations (7. 1), (7. 3), and (7. 4). By retaining 
one more digit in the more accurate numbers, we reduce to zero the errors 
of those terms and thus reduce the error of the final result. 

In the case of subtraction or of addition of only two numbers, round off 
the more accurate number to the same number of decimal places as the 
less accurate one before subtracting or adding. 

10. Accuracy in the Evaluation of a Folmula or Complex Expression. 
The two fundamental problems under this head are the following: 

(a) Given the errors of several independent quantities or approximate 
numbers, to find the error of any function of these quantities. 

(b) To find the allowable errors in several independent quantities in 
order to obtain a prescribed degree of accuracy in any function of these 
quantities. 

10a). The Direct Problem. The first of these problems is solved by 
replacing the given approximate numbers by the letters a, b, e, * * • or 
tti, U 2 , Uz, taking the partial derivatives of the function with respect to 
each of these letters, and then substituting in formula (6.1^ or (6.2). 
An exact number, such as 2, 3, 10, etc., is not replaced by a letter before 



Abt. 10] 


EVALUATION OF FORMULAS 


25 


taking the derivatives.* We shall now work some examples to show the 
method of procedure. 


Example 1. Find the error in the evaluation of the fraction 
cos 7° lO'/logto 242.7, assuming that the angle may be in error by 1' and 
that the number 242.7 may be in error by a unit in its last figure. 

Solution. Since this is a quoCient of two functions, it is better to 
compute the relative error from the formula Er ^ Aui/ui 4* Auj/us and 
then find the absolute error from the relation E^-^NEr. Hence if we 
write 

cos 7° 10' cos® 


N 


Ui/Ut 


we have 


logio 242.7 logic y 

A«i — A cos JF =« — sin xAa:, 

Amj = A logic y *— 0.43429 (Ay/y) . 

_ _ sini . , 0.4.3429 . 

Er ^ nrz Ax + —fzzrzr 


COS X 


or 


Er ^ tan xAx + 


ylogy 
0.435 


ylogy 


Ay. 


Now taking x = 7® 10', Ax = 1' — = 0.000291 radian, y ■ 
and using a slide rule for the computation, we have 


242, Ay — 0.1, 


Er < 0.126 X 0.000291 + - 0.00011. 

Since N =■ cos 7® lO'/log 242.7 0.41699, we have 

Fc = 0.00011 X 0.416 — 0.000046, 

or Eh < 0.00005. 

The value of the fraction is therefore between 0.41604 and 0.41694, 
and we take the mean of these numbers to four figures as the best value 
of thft fraction, or 

N — 0.4160. 


Example 2. The hypotenuse and a side of a right triangle are found 
by measurement* to be 75 and 32, respectively. If the possible error in 


* Adopted or accepted values of physical, chemical, and astronomical constants are 
to be treated as exact numbers, but results obtained by using these numbers aa 
multipliers or divisors are not to be relied upon to more significant figures than are 
need in the constants themselves. 



s« 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Ghat. 1 


the hypotenuBe is 0.2 and that in the side is 0.1, find the possible error 
in the computed angle A. 

Solution. Lettering the triangle in the usual manner, we have 

sinil — 32/76 — a/c. 

A — 8in"'(a/e), 


and 

Now 


Aii — (dA/da)Aa -|- (dA/dc)Ae. 


0i/ao — 1/Vc? — o*, 


M/dc o/(cVc* — o»). 

Taking the numerical values of c and a in such a manner as to give the 
upper limits for dA/da and dA/dc, and remembering that Aa — 0.1, 
Ac — 0.2, we have 


AA < 


V (74.8)*— (32.1)* 


XO.l 


32.1 


74.8 V(74.8)* — (32.1) 


X 0.2 — 0.00276, 


or 


AA < 0.0028 radian — 9' 38". 


The possible error in A is therefore less than 9' 38". 

10b). The Inverse Problem. We now turn our attention to the second 
fundamental problem mentioned at the beginning of this article: that 
of finding the allowable errors in «i, tt 2 , * * u* when the function N is 
desired to a given degree of accuracy. This problem is mathematically 
indeterminate, since it would be possible to choose the errors Aui, Auj, etc. 
in a variety of ways so as to make AN less than any prescribed quantity. 
The problem is solved with the least labor by using what is known as 
the principle of equal effects.* This principle assumes that all the partial 
differentials {dN/dui)Aui, {dN/du 2 )Aut, etc., contribute an equal amount 
in making up the total error AN. Under these conditions all the terms 
in the right-hand member of equation (6. 1) are equal to one another, 


AUn. 


SO that 



Hence 

327 ^ 327 ^ 

AN — n ■=;— Atti — n T — Atta 
3tt dut 

01*, 


^ AN , A27 

3N" ’ 

A 


*^dui ** 3tti 

”9«. 


* See Palmer's Theory of Measwremonts, pp. 147*148. 



Amt. 10 ] 


EVALUATION OF FORMULAS 


27 


Example S. Two sides and the indaded angle of a trianglar city lot 
are approximately 96 ft., 87 ft., and 36®, respectively. Find the allowable 
errors in these quantities in order that the area of the lot may be deter- 
mined to the nearest square foot. 

Solution. Writing b — 96, e — 87, A » 36®, and denoting the area 
by «, we have 

u — i&e sin A — 1(96 X 87 sin 36®) — 2466 sq. ft. 

Hence 

du/db — le sin Al, du/de — ■ 16 sin AL, du/dA — 16c cos Al. 
Substituting these quantities in (6. 2), we find 

Att/u — A6/6 -f- Ae/c AA./tan A. 

Now since the area is to be determined to the nearest square foot we 
must have Au < 0.5 ; and by the principle of equal effects we must have 


^ 

6 3 tt < 

Hence A6 < 96 X 0.000068 - 
In like manner , 


0.6 

3 X 2456 ■ 
> 0.0066 ft. 


1 

14730 


< 0.000068. 


^ — I ^ , or Ac < 87 X 0.000068 — 0.0059 ft. ; 

and 

£ ^ AA 
3 tt “ tan A 

Hence from a table for converting radians to degrees we find AA — lO''. 

It thus appears that in order to attain the desired accuracy in the area 
the sides must be measured to the nearest hundredth of a foot and the 
included angle to the nearest 20'' of arc. 

This problem could also be solved by assuming that the possible errors 
in the measured sides might be 0.005 ft. and then computing the per- 
missibly error in the measured angle. 

Example U. The valhe of the function 6a;*(logioa; — sin2y) is required 
correct to two decimal places. If the approximate values of x and y are 
16.2 and 57®, respectively, find the permissible errors in these quantities. 

Solution. Putting 

u — 6** (logio X — sin 2y) — 6 (16.2)*(logio 18.2 — sin 114®) 

— 371.‘9, 


3 



28 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


we have 


dv dx — 12® « X — sin 2y) + 6® X 0.43429 — 88.64, 
du/dy — — 12®* cos 2y — 1127.7. 

Hence 

Att — (3tt/3®)A® 4- (9«/9y) Ay — 88.64A® + 1127.7Ay. 

In order that the required result be correct to two decimal places we 
must have Au < 0.005. Then by the principle of equal effects we have 


Alt 0.005 
^ 2 X 88.54 

*a®' 


= 0.000028, 


Ay=- 


Au 



< 


— 0" .45. 


0.005 

2 X 1127.7 


0.0000022 rad. 


Since the permissible error in ® is only 0.00003, it will be necessary to 
take ® to seven significant figures in order to attain the required degree 
of accuracy in the result. The value of y can then be taken to the nearest 
second. 

The reason why the permissible errors in ® and y are so small in this 
example is that the factor logio^ — sin 2y causes the loss of one significant 
figure by subtraction. 

Remark. It is neither necessary nor desirable to investigate the accuracy 
of all proposed computations. But when we are in doubt about the possi- 
bility of attaining a certain degree of accuracy in the final result, we 
should make the necessary investigation. It usually suffices to carry all 
computations to one more figure than is desired in the final result and 
then round off the result to the desired number of figures, if the accuracy 
of the given independent quantities is such as to permit this. 


11. Accuracy in the Determination of Arguments from a Tabulated 
Function. In many problems it is necessary to compute some function 
of an unknown quantity and then determine the 'quantity from tabulated 
values of the function. Examples of this kind are the ..determination of 
numbers from a table of logarithms, and angles from trigonometric tables. 
If the computed function happens to be affected with an error, the argu- 
ment determined from this function is necessarily incorrect in some degree. 
The purpose of this article is to investigate the accuracy of the argument 
whose value is required. 



An. 11] 


DETERMINATION OF ARGUMENTS 


29 


In tables of single entry are tabulated functions of a single argument. 
Calling X the argument and y the tabulated function, we have 

y— /(»)• 

From this we get the relation 

Ay = /''(T)A 2 ^ approximately, 

from which we have 

( 11 . 1 ) Ax-Ay/f(®). 

This is the fundamental equation for computing the error in arguments 
taken from a table. Here Ay represents the error in the computed function 
whose values are tabulated, and Ax is the corresponding error in the argu- 
ment. It will be noted that the magnitude of Ax depends upon three 
things: the error in the function, the nature of the function, and the 
magnitude of the argument itself. We shall now apply (11. 1) to several 
functions whose values are tabulated. 

1. Logarithms. 

(а) /(x) — log. X. 

f (x) =■ 1/x. 

(1) Ax — xAy, from (11.1). 

(б) /(x) — log.oa:. 

f(x)’== M/x, where M — 0.43429. 

.*. Ax — xAy/Jlf — 2.3026xAy. 

Hence 

(2) Ax < 2.31xAy. 

2. Trigonometric Functions. 

(o) /(x) — sin X. 

f{x) — cosx. 

(3) Ax >9^ Ay/cos X — sec xAy radians, 
or 


( 4 ) 


( 6 ) 


(Ax)'^ — 206264.8 sec xAy seconds, 
/(x) — tan X. 
f (x) — 8(C*X. 



ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


result found by means of tables is rather involved. The reader will find 
a masterly treatment of this matter in J. Luroth’s Vorlesungen uber 
nutnerisches Rechen, Leipzig, 1900. 

However, the problem is of little practical importance, because the errors 
in such a computation rarely if ever combine so as to produce their 
maximum aggregate effect. They neutralize one another as the calcu- 
lation proceeds. 

12. The Accuracy of Series Approximations. It is frequently easier 
to find the numerical value of a function by expanding it into a power series 
and evaluating the first few terms than by any other method. In fact, this 
is sometimes the only possible method of computing it. The general 
method for expanding functions into power series is by means of Taylor’s 
formula. The two standard forms of this formula are the following: 

(1) f(x) -m + (x-x)r(a) + + ■ ■ . (a, 

+ + — »)], 0<»<1. 
fit 

m /(*+»)- /w + v(*) + jy rw +■■• + 

fit 

On putting a — 0 in (1) we get Maclaurin’s formula: 

( 3 ) Hx) -m +xrm+j^r(o) +■ • •+ 

+ 0 < 3 < 1 . 

The last term in each of these three formulas is the remainder after 
n terms. This remainder term is the quantity in which we shall be 
interested in this article. The forms of the remainder given above are 
not the only ones, however. Another useful form vill be given below. 

ISa). The Remainder Terms in Taylor's and Maclaurin’s Series. De- 
noting by Rn{x) the remainder after n terms in the Taylor and Maclaiirin 
expansions, we have the following useful forms : 

1. For Taylor’s formula (1) : 

(a) 5.(x) - [a + (* _ a)], 0 <9 < 1. 



Abt. 12] 


SERIES APPROXIMATIONS 


33 


2. For Taylor^s formula (2) : 

(a) + 0<fi<l. 

fil 

(b) Itn(x) - (a: + fc _ 

3. For Maclaurin’s formula: 

(а) Rn(x)-^fM(9x), 0<e<l. 

(б) Rn(x) ~ 

It will be observed that the second form (the integral form) is perfectly 
definite and contains no uncertain factor 6. In using either form, however, 
it is necessary first to find the nth derivative of f{x). 

Since the integral form of i2ii(a;) is not usually given in the textbooks 
on calculus, we shall show how to apply it to an example. 

Example. Find the remainder after n terms in the expansion of 

log* (* + *)• 

Solution. Here 

f(x) — log,®, 
fix) -1/®, 

r(*)- (1/**),. 

/*"(*) (6/*‘), 


/W(®)- 


• ®» 


/2»(®)-(-l) 




-1)» c 

— l)lJo 


^ 

(® + h ■ 




Now since t varies from 0 to li, the greatest value of N»(®) is obtained 
by putting t — h in the integrand. We then have, omitting the factor 
( — which is never greater than 1, 



34 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


< r* L r i(*v 

Jo ®" ^Jo *" n n\x / 

Suppose ® — 1, A — • 0.01. Then A/* — 0.01. If, therefore, we wish to 
know how many terms in the expansion of log* 1.01 are necessary in order to 
get a result correct to seven decimal places we take Rn ^ 0.00000005. 


(1/n) (0.01)" -t. 0.00000005. 

It is evident by inspection that n •• 4 will give a remainder much smaller 
than the allowable error. Hence we take four terms of the expansion of 
log. + 

The reader can easily verify that the hrst form of remainder gives the 
same result as that just found. 


12h). Alternating Series. An alternating series is an infinite series in 
which the terms are alternately positive and negative. Such a series is 
convergent if (a) each term is numerically less than the preceding and 
(b) the limit of the nth term is zero when n becomes infinite. 

Alternating series are of frequent occurrence in applied mathematics 
and are the most satisfactory for purposes of computation, because it is 
always an easy matter to determine the error of a computed result. The 
rule for determining the error is simply this : 

In a convergent alternating series the error committed in stopping with 
any term is always less than the first term neglected. 


Thus, since 


log. (1 “I" ®) “ * — x*/2 -f- ®*/3 — x*/4e -f- x^/5 — ’ ’ ■ > 

we have 

log. (1.01) - 0.01 - (Mill + R, 

where R < | (0.01)74 ] — 0.0000000025. 

We therefore get a result true to eight decimal places by taking only 
three terms of the expansion. 

l£c). Some Important Series and Their Remainder Terms. Belqw are 
given some of the most useful series and their remaii^der terms, alternating 
series not being included because their remainder ‘terms can be computed 
by the role given above. 

1. The Binomial Series. 


(1+af)* 


1 + mx -j- 

+ 


”*("* — l) j. m(m— l)(m — 2) , 

21 ^ 31 ^ 

— !)(”* — 8 ) • • • (*» — 



An. 12] 


SERIES APPROXIMATIONS 


S5 


vhere 


n! 

in all cases. 


'{m — n-j- 1) 






nl 

|m(m — l)(m — 2)- • ■ (wt — n + 1) 


if * > 0. 

X* 


(«) ^-<1 „1 ( l +®)»-| 

if s; < O and n > m. 

(d) < 1 a:" I (1 + x)" if — 1< tn < 0. 

If m is a fraction, positive or negative, or a negative integer, the binomial 
expansion is valid only when | x | < 1. Also, except when m is a positive 
integer, a binomial such as (a -j- &)” must be written in the form 

1 + if a > 6, or 5" ^1 + if 6 > a, 

before expanding it. 

P. Exponential Series. 


(a) e--l+x + ^ + |L + . 




(» — 1)1 ■’"nl 


If in (a) we put * — 1 we get the following series for computing e : 


(c) 

Here 


'-*+>+l+^+CT+- 

*-ST- 


+ (^--T7T+;rr- 


But since e < 3 and 9 ^ 1, it is plain that 

3 


id) 


IU< 


»! 


A more definite formula for Rn can be found as follows: 
Writing more than n terms of the series (c), we have 




+ rH 


nP (n + l)l^(n + 8)l 





ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


where the remainder after n terms is 

ff 1 I 1 I 1 . 


-^(' + ?TT+: 


nlV ^n+1 (n+.l)(n + 2)^ /* 

The quantity in parenthesis on the right is clearly less than the sum of 
the geometric series 

H-l/ft + l/n» + l/n* + - • •, 


the sum of which is 


Hence 


1 

1 — 1/n 


^<nl n — 1 ' l)(n— 1) 


By means of this formula (e) we can find the requisite number of 
terms in the expansion (c) to give the value of e correct to any desired 
number of decimal places. Thus, if we wished to find e correct to ten 
decimal places by means of the series (c) we would find n from the equa- 
tion l/(n — 1)(» — 1)1 — 0.00000000005. With the aid of a table of 
the reciprocals of the factorials we find that n — 1 — 13, or n — 14. We 
should therefore take 14 terms of the series (c). We find in like manner 
that in order to compute « correct to 100 decimal places we should take 
71 terms of the series (c). 


S. Logarithmic Series. 

(a) loga (m + 1) = log, m -f 2 f ’+ 


1 ^ 

3(2TO-f !)• ^ 6(2m + l)* 


(2«— l)(2m + l)> 


:] +«-. 


To find an upper limit for Rn we have 


J ?.-2 [ 


(2» + 1) (2m -H !)*•♦" (2» + 3) (2m + 


+ (2» + 5)(2m-Hl)**** J 

Each term of the series in brackets, after the first, is less than the corre- 
qxmding term of the series 



Abt. 12] 


SERIES APPROXIMATIONS 


87 


(2n + 1) (2m + 1)*»*‘ + (2n + 1) (2m + !)*»« 
. 1 


or 


(2»+l) (2m + !)*»♦• 
(2m + 1)* 


(2»+l)(2m + l)*"*‘ 
which is a geometric series with ratio 


(2m + 1)* ^ (2m + 1) 
1 


rH ]. 


(2m + l) = 


and sum 


1 — 


(2m + 1) 
iZ,<2 ( 


(2m + 1)* „ 

» or i b / riL 

? 4m(m4-l) 


1 \ (2m + l)» 

(2n + 1) (2m + 1)*"*^ / 4m(m + 1) 
1 1 


Therefore 

(h) Rn< 


2 m(m + 1) (2n 4- 1) (2m + 

1 1 

2 m(m + 1) (2n + 1) (2m 4- 


Example 1. To compute In 2 * by taking three terms of (a) we have, 
since m — 1, » — 3, 

and by (6), 

which affects the fourth decimal place. Since the true value of In 2 to 
eight decimal places is 0.69314718, the error in the value found above is 
0.000143, which is less than 0.000147. 

Example 2. To find In 5 correct to ten decimal places we have m « 4, 
(1/2) • (1/10^®). Hence, by (ft), 

1 *• 1 11 
^2 4X6(2n4-l)(9)*"-» "“2 10« * 
or 

(2» 4- 1) (9)*"-" — 5 X 10* — 500,000,000. 


We find by trial that n is about 4. 1, and that for » ^ 5 the logarithm will 
be correct to 11 decimal places. 


* Frequently in this book we shall write In for log,. 



38 


ACCURACY OF APPROXIMATE CALCULATIONS 


(Chat. 1 


l£d). Some nth Derivatives. In computing the remainder term in a 
aeries it is necessary to have the nth derivative of the given function. 
To facilitate the calculation of R% we therefore give below a list of nth 
derivatives of some simple functions. The symbol D denotes differentia- 
tion with respect to 2, or — d/dx. 


(«) 

(6) 

ic) 


(d) 

(«) 

(/) 


2)»a* — • a*(log, a)\ 

D* sin X — sin[* -f- n(ir/2)]. 
f?" cos 2 — COs[2 -f- n(ir/2)]. 
/ 1 V (— l)"n!6« 


(— l)»(n — 1)!&" 
(a + ftx)" 


(o -|- 62)"** 

D. ( > ) _ (-l>-l-3-5 - (8.-l) 

VVM^/ 2"(<i + 6x) <->>/• 

i)"log«(o + bx) 

ig) + i + - • • + !/«)]. 

(A) 2>" log, ( 1 + 2*) — (— l)-‘2(n — 1) 1 COS [n sin"* J 

(1 

(i) fl- Ua- X - [» .in- . 

w p- ( (xiy^ ) - ^ 


+ (a + i5o)sin(n-f-l)d]. 


where 


P — V(* — «)• + 6* 

b 


$ — tan'*- 


2 — a 


For an extensive investigation of nth derivatives the reader is referred 
to Steffensen^s Interpolation, pp. 231-241. 



Art. 13] 


ERRORS IN DETERMINANTS 


39 


13. Errors in Determinants. When the elements in a determinant are 
inexact numbers, due to rounding or othervise, the value of the deter- 
minant may be seriously affected by the loss of the most important sig- 
nificant figures in the expansion or evaluation process. The amount of 
such losses cannot be determined in advance. We can, however, determine 
the upper limit of the error in a ^determinant whose elements are subject 
to given possible errors. For purposes of illustration we consider a deter- 
minant of the third order. 


Let 



Xi 

X, 

a-3 

(1) 

D y. 

y* 





^3 


Now if the elements are subject to possible errors of unknown signs but of 
magnitudes Axj, Ayi, etc., which are small in comparison with Xi, jft, etc., 
then the value of D will l)e subject to the possible error AD such that 

Xj + AXt Xi -|- Ax, x, -f- Ax, 

(2) D-fAD- yi+Ay, ys + Ay, ya + Aya 

X, -f AZi z, Az, z, -f AZa 

By the addition theorem of determinants the right member of (2) can 
be expressed as the sum of eight determinants, the first of which is the 
original determinant D. Each of three of the remaining determinants 
contains one column of error elements, each of three of the others contains 
two columns of error elements, ^nd the remaining determinant has three 
columns of error elements. All determinants containing more than one 
column of error elements will be neglected, because, when expanded, the 
resulting terms will all contain second and third powers of the errors and 
will th.>refore be negligible in comparison with terms containing only the 
first powers of the erA>rs. The value of AD is thus the sum of three deter- 
minants each containing a single column of error elcnicnts. 

But those determinants are only the differential of D, and we therefore 
have 



dxi 

*2 

X* 


•Pi 

dXi 

Xi 


( 3 ) 

dD — djfi 

y* 

y* 

+ 

yi 

dyt 

y » 

+ 


dZ\ 

*2 

<2 



dt. 

*2 



Xi 

*1 


Ji 

y * 

*2 


dXi 

dtft 

dti 


t 



40 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


or 

(4) — {x3Z3 — XiZa)dyi + (x^ya — 


— {yiii—y3Zi)dx2 4- (X|*3 — X3«i)rfy*— (x,y,— x,y,)<i*. 


+ (yi2*— y2*i)<ix,— (x,2* — x,Z|)dy, + (x,y3 — Xjy,)d«3, 

The maximum possible error would occur when the signs of the elements 
and the signs of the errors were such that all the eighteen terms in the 
right member of (4) were of the same sign — a very remote possibility. 

Equation (4) shows that the error in a determinant composed of inexact 
elements may be anything from zero up to a number of considerable mag- 
nitude. It must be borne in mind, however, that the terms in (4) will 
largely cancel one another so that, in general, dD will not be large. 

14. A Final Remark. The present chapter may appropriately close 
with the following lines from Alexander Pope: 

A little learning is a dangerous thing; 

Drink deep, or taste not the Pierian spring: 

There shallow draughts intoxicate the brain, 

And drinking largely sobers us again. 

Pope was probably not thinking of approximate calculation when he 
wrote those lines, but no better advice could be given with respect to that 
subject. A smatter of knowledge of approximate calculation is worse than 
no knowledge at all. Fragmentary knowledge may lead to rough results 
that cannot be trusted. The author has seen students and teachers obtain 
far worse results from applying hazy ideas of the subject than if they had 
never heard of it. Their faulty work was due mostly to drastic rounding 
of numbers (at the beginning of a computation or at intermediate steps) 
or to dropping non-negligible terms in a series. 

The essence of this chapter cannot be given in one or two recitations, 
nor in two or three. If the teacher has only two qy three recitations to 
devote to it, he had better leave it out entirely. 


EXERCISES I 

1. Round off the following numbers correctly to four significant figures : 
63.8543, 93487, 0.0063945, 83615, 363042, 0.090038, 53908. 



Abt. 14] 


EXERCISES 


41 


2. A carpenter measures a 10-foot beam to the nearest eighth of an 
inch, and a machinist measures a ^-inch bolt to the nearest thousandth 
of an inch.* Which measurement is the more accurate? 


3. The following numbers are all approximate and are correct as far 
as their last digits only. Find their sum. 

136.421, 28.3, 321, 68.243, 17.482. 


4 . Find the sum of the following approximate numbers, each being 
correct only to the number of significant figures given: 

0.15625, 86.43, 191.6, 432.0 X 10, 930.42. 


5. The numbers 48.392 and 6852.4 are both approximate and true only 
to their last digits. Find their difference and state how many figures in 
the result are trustworthy. 


6. Find the value of V 10 — ir correct to five significant figures. 


7. The theoretical horsepower available in a stream is given by the 
formula 


P. 


ivhQ 
550 * 


where h head in feet, Q <— discharge in cubic feet per second, and id = 
weight of a cubic foot of water. The weight of fresh water varies from 
62.3 to 62.5 lbs. per cubic foot, depending upon its temperature and purity. 

If the measured values of Q and h are Q ^ 463 cu. ft./sec. and 
h — 16.42 ft., find the H. P. of the stream and indicate how many figures 
of the result are reliable. 


8. The velocity of water flow^g in long pipes is given by the formula 


V 



ft./sec.. 


where g — acceieration of gravity — 32.2 ft./sec.* 

h ^ head in feet, 
d — > diameter of pipe in feet, 

I ^ length of pipe in feet, 
f — coefficient of pipe friction. 

* When a measurement is recorded to the nearest unit, the absolute error of the 
measurement is not more than half a unit. 



42 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Craf. I 


In this problem the factor f is the most uncertain. It varies from 0.01 
to 0.05 and is usually somewhere between 0.02 and 0.03. Assuming that 
f is within the limits 0.02 and 0.03 and taking 

^ — 32.2, 
h — 112 feet, 

d — ^ foot, (assumed exact) 

{ — 1865 feet, 

find V and indicate its reliability. 

9. The velocity of water in a short pipe is given by the formula 


V 



2gh 

1.5 -I- fl/d 


where g, h, f, I, and d have the same meanings as in the preceding 
example. Taking Z — 75 feet and the other data the same as ia Ex. 8, 
find V and indicate its reliability. 


10. The acceleration of gravity at any point on the earth’s surface is 
given by the formula 

g — 32.1721 — 0.08211 cos 2L — 0.000003H, 


where H — altitude in feet above sea level, and L — latitude of the place. 
It thus appears that the value of g is not 32, nor 32.2, nor even 32.17. 

Compute the kinetic energy of a 100-pound projectile moving with a 
velocity of 2000 feet per second by taking g equal to 32, 32.2, and 32.17 
in succession and note the extent to which the results disagree after the 
first two or three figures. 


11. How accurately should the length and time of vibration of a pen- 
dulum be measured in order that the computed value of g be correct to 
0.05 per cent? 


12. If in the formula 


R 


iL^!l 

2h ~ 2 


the percentage error in R is not to exceed 0.3 per cent, find the allowable 
percentage errors in r and h when r — 48 mm. and — 56 mm. 

13. When the index of refraction of a liquid is determined by means 
of a refractometer, the index n is given by the formula 


n— y/N* — sin* 9. 


If »■ 1.62200 with an uncertainty of 0.00004 and 0 — 38” approximately, 
find &0 in order that n may be reliable to 0.02 p^r cent. 



An. 14] 


EXERCISES 


43 


14. The area of the cross section of a rod is desired to 0.2 per cent. 
How accurately should the diameter be measured? 

16. The approximate latitude of a place can be easily found by 
measuring the altitude h of Polaris at a known time t and using the 
formula 

L — h4 — pcoet, 

where p — polar distance — 90® — declination. 

Treating p as a eonstant and equal to 1®07'30'', and taking h — 41®26', 
t ■— 0'*38™42*, find the error in L due to errors of 1 in A and 5" in <. 

16. In the preceding example find the allowable errors in h and t in 
order that the error in L shall not exceed 1', using the same values of p, 
i, and h as before. 

17. The distance between any two points Pi and Pi on the earth’s 
surface is given by the formula 

cos D — sin Li sin Li 4- cos Li cos Li co8(Ai — k»), 

where Lt, Li and A|, Aj denote the respective latitudes and longitudes of 
the two places. Find the allowable errors in Li, Lt, Ai, Ai in order that 
the error in D shall not exceed 1' (a geographical mile), taking 

Li — 36®10'i7, L, — 58®43'J7, At — 82®15' W, A, — 126®42'W. 

18. The fundamental equations of practical astronomy are: 

(1) sin A — sin 3 sin L 4- cos 8 cos L cos f, 

(2) cos A cos X — — sin 3 cos L 4- cos 3 sin L cos t, 

(3) cos A sin A — cos 3 sin t, 

where 3 denotes declination, t hour angle, A altitude, and A azimuth of 
a celestial body and L denotes the latitude of a place on the earth. The 
declination 3 is always accurately known and may therefore be considered 
free from error. 

Differentiating (1) Jby considering 3 constant and h, L, t aa variables, 
we have • 

COB h dh ^ ami COB LdL — ooainn LeoatdL — co83co8Lsin<<ff. 

Beplacing cos 3 sin £ cos f and cos 3 sin f on the right by their values from 
(2) and (3), respectively, we get 

dh ^ — (cos X <i£ 4- BUI ^ COB L df). 


4 



44 


ACCURACY OF APPROXIMATE CALCULATIONS 


[Chap. I 


Solving for dL, 

(4) — (secil (2A 4* 

This equation shows that the numerical value of dL is least when A is 
near 0** or 180**, that is, when the body is near the meridian. If A should 
be near 90**, that is, if the body should be near the prime vertical, the 
error in L might be enormous. Hence when determining latitude the 
observed body should be as near the meridian as possible. 

Using equation (4), compute dL when dfc — 1', d< — 10®, L — 40**, 
A — 10®, and A — 80**. 

19. Using the formula dL — — (sec Adh -|- tan A cos Ldt), find the 
allowable errors in t and h in order that the error in L may not exceed 1* 
when L — 40** and (a) A — 10** and (b) A — 76**. 

20. From the relation 

cos hdh — (sin 8 cos L — cos 8 sin £ cos t)dL — cos 8 cos £ sin tdt 

we find by means of (2) and (3) of Ex. 18 

dh cos AdL 
sin A cos L 

This equation shows that dt is least numerically when A is near 90**, that 
is, when the observed body is near the prime vertical; it also shows that 
when the body is on or near the prime vertical an error in the assumed 
latitude has practically no effect on the error in t. 

Compute dt when dh — 1', dL — 5', L — 40**, A — 10**, and A — • 80**. 

21. Using the formula for dt in the preceding example, find the allow- 
able errors in L and h in order that dt may not exceed 3”, taking L — > 40**, 
A — 10**, and A — 80**. 

22. Using the formula of Ex. 20, take dt — 3®, dh — Y, and find dL 
for A — 10* and A — 80*. 

23. In the equation 

X — i asin (kt -|- a) 

suppose ‘a, k and a are subject to the errors Aa, Ak, Aa, respectively. 
Compute Ax and see which of the errors Aa, Ak, Aa is 'the most potent 
in causing an error in x. 

24. Find the value of 

/— J (8inx/x)dx 
correct to five decimal places. 



Art. 14] 


EXERCISES 


45 


25. Compute tl)o value of the integral 


/ 



— 0.162 sin* ipdift 


correct to five significant figures by first expanding the integrand by the 
binomial fheorem and then integrating the result term by term. 


26. In the formula 

onuk 6ini{ht — h,)coai(h, + k,) 

8inJ(fi — f.)cosfc, 

k denotes an angle; hi, h 2 , h,, U, U arc all positive; (A, — ht) and (f| — <«) 
are small quantities; and {hi — h^) is small in comparison with ht. Find 
the maximum error in k due to errors in t and h, assuming that 

I dA. I - I dA. I I dh, I and | df.l - 1 df, |. 

27. Using the result found in Ex. 26, find the maximum value 

of dk when dl^O'M, dA—0'.05, A, — 40“, A, — 40“ 16', A, — 40“ 30', 
U — t, 4«. 



CHAPTER II 


INTERPOLATION 

DIFFERENCES. NEWTON’S FORMULAS OF INTERPOLATION 

15. Introdttctioii. Interpolation hac been defined as the art of reading 
between the lines of a table, and in elementary mathematics the term usually 
denotes the process of computing intermediate values of a function from 
a set of given or tabular values of that function. The general problem 
of interpolation, however, is much larger than this. In higher mathematics 
we frequently have to deal with functions whose analytical form is either 
totally unknown or else is of such a nature (complicated or otherwise) that 
the function can not easily be subjected to such operations as may be 
required. In either case it is desirable to replace the given function by 
another which can be more readily handled. This operation of replacing 
or representing a given function by a simpler one constitutes interpolation 
in the broad sense of the term.* 

The general problem of interpolation consists, then, in representing a 
function, known or unknown, in a form chosen in advance, with the aid 
of given values which this function takes for definite values of the inde- 
pendent variable. 

Thus, let y — f{x) be a function given by the values yo, yi, yt,‘ • ' ym 
which it takes for the values Xo, x,, X 2 , ■ ■ * x* of the independent variable x, 
and let ^(x) denote an arbitrary simpler function so constructed that it 
takes the same values as f(x) for the values Xo,Xi, X 2 , - - -x^. Then if 
/(x) is replaced by ^(x) over a given interval, the process constitutes 
interpolation, and the function ^(x) is a formula of interpolation. 

The function ^(x) can take a variety of forms. When ^(x) is a poly- 
nomial, the process of representing /(x) by ^(x) is called parabolic or 
polynomial interpolation; and when ^(x) is a finite trigonometric series, 
the process is trigonometric interpolation. In like manner, ^(x) may be 
a series of exponential functions, Legendre polynomials, Bessel functions, 
etc. In practical problems we always choose for ^(x) the simplest func- 
tion which will represent the given function over tljie interval in question. 
Since polynomials are the simplest functions, we usually take a polynomial 
for ^(x), and nearly all the standard formulas of interj^olation are poly- 

* The author thinks that this process of replacing a complicated function by a 
simpler one should be called the principle of analytical replacement or the principle 
of functional aubetitution. 


46 



Art. 16] 


THEOREMS OF WEIERSTRASS 


47 


nomial formulas. In case the given function is known to be periodic, 
however, it is better to represent it by a trigonometric series. 

The justilication for replacing a given function by a polynomial or by a 
trigonometric series rests on two theorems proved by Weierstrass * in 1886. 
These theorems may be stated as follows : 

I. Every function which is continuous in an interval (a, b) can be repre- 
sented in that interval, to any desired degree of accuracy, by a polynomial ; 
that is, it is possible to find a polynomial F(x) such that | f(x) — F(x) | < c 
for every value of x in the interval (a,b), where c is any preassigned 
positive quantity. 

II. Every continuous function of period 2x can be represented by a finite 
trigonometric series of the form 

g(x) — Uo + sill ^ + ‘ ‘ ' i- On sin nx 

+ 6 , cos X 62 cos 2 x • • ■ - 1 - 6 a cos nx ; 

or I f{x) — ^(x)| <8 for all values of x in the interval considered, where 
S represents any preassigned positive quantity. 



Geometrically these theorems mean that, having drawn the graphs of 
y ii. /( c), y 1 — f{x) «, and y — /(x) — c, it is possible to find a polynomial 
or a finite trigonomelric series whose graph remains within the region 
bounded by y —•/(*) -f- « tf /(*) — « for all values of x between a 
and b, however (small < may be. (See Fig. 1.) These theorems mean, 
therefore, that the given function may be replaced by a polynomial or by a 
finite trigonometric series to any desired degree of accuracy. 

* Ober die axalytieehe Dantellbarkeit eotenannter mUkUrlieker Funktionen eUter 
reelen Verdederliohen (Sitzungsberiehte der Kgl. Ak. der Wise., 1885). 


48 


INTERPOLATION— NEWTON'S FORMULAS 


[Chap. II 


16. Differences. If yxj yi,' • • yn denote a set of values of any func- 
tion y^f{x), then yi — yo, y* — yi, y* — y*, • • yn — y«-i are called the 
first differences of the function y. Denoting these differences by Ayo,, 
Ayi, Ay*, etc., we have Ayo — yi — y«, Ayi — y* — • • • Ay„.i — y» — y«.i, 

Ay, — y,*i — y,. 

The differences of these first differences are called second differences. 
Denoting them by A^yo, A'^y,, etc., we have 

A*yo — Ay, — Ay# — y* — 2yi + yo, 

A*y, — Ay* — Ay, — y* — 2y* + yi, 

etc. 

In like manner, the third differences are 

A*yo — A*y, — A*yo — y, — 3y, -f- 3y, — y#, 

A*y, — A*y* — A*yi — y^ — 3y, -f 3y* — y„ 

etc. 

The following difference table shows how the differences of all orders 
are formed : 


Ay A'y A*y A^y A”y A*y A^y A*y 




Ay, 






0I 

Vx 

Ay* 

A*y, 

A'a, 




mt 

Vi 

Ay, 

A*». 

AVi 

AV. 

A»y. 



Vt 




A*y, 


A*y. 



Ay, 


A'Vi 


A*y, 

A^a, 


Vx 


A*», 


A*y« 


AVi A*y» 



Ay, 


A*», 


A*y, 

A»a, 


y% 

Ay, 

A»a« 

A*». 

A‘y» 

AV. 

A*a, 



Ay, 

A«». 

AVi 

A‘»4 



Wj 

yi 

Ay, 

A*y. 






y% 





1 





Table 

1. DiBgonal Difference Table. 

This table is called a diagonal difference table. 

The majority of dif- 

ference tables 

are 

of this kind, but for many purposes a more compact 

table, 

called a 

horizontal difference table, is 

preferable. In the horizontal 

difference tables the differences of different order are 

denoted by subscripts 







Abt. 16] 


DIFFERENCES 


instead of exponents. Using the notation for horizontal differences, we can 
rewrite the preceding difference table in the horizontal form as follows 


9 

y 

A,y 


AaV 



A||^ 

A,y 

AaV 

9. 

2^0 










Vi 

AiVi 



• 






Vt 









9m r 

y% 



A^i 






9m 

Va 

AiV, 

AsV, 

A^t 

AtV, 





9m 

Vm 



A,y, 

A^t 

AiV, 




9m 

v% 

AiV. 

A^t 

Aa» 

A«y. 

AiV. 




9m 

Vi 


AiV, 

Adh 

A«y, 

A^i 

A^, 

A,y, 


9m 

y% 

A,y, 

AiV. 

A^t 

A«y. 

Aoft 

A«y. 

A^t 

AiV. 


Table 2. Horizontal Difference Table. 


In order to see the relation between horizontal and diagonal differences 
of the same order, we give in Tables 3 and 4 the differences of both kinds 
in terms of the y’s. 

Inspection of these tables shows that the top diagonal line is the same 
in both, but that the bottom upwardly inclined diagonal in Table 3 is the 
same as the bottom horizontal line in Table 4. Also, from Table 3 we have, 
for example, 

A*yi -= — 3ys + 3y* — y,. 


Likewise, from Table 4 we have 


Hence 


Asy* — y 4 — 3y, + 3y* — y,. 

A»yi — As^4. 


A glance at Tables 3 and 4 will show that the general relation between 
the A’s affected with exponents and those affected with subscripts is 


or 


A^yifc — Amy*,„ (going forward from y*), 

• Amy. — A^yn.m (going backward from y»). 


where m denote^ the order of differences and k and n the number of the 
tabulated value. 

The relation between diagonal differences and horizontal differences 
can be illustrated still further by a numerical example. The tables on 
page 51 show both kinds of differences for a set of equidistant values of 
the function y — sinh x. 




INTERPOLATION— NEWTON'S FORMULAS 


[Chap. II 


d) 

till 

i i i i 

+ + + + 

M fl« « « 


^ ^ ^ 

^ ^ ^ ^ 

+ + + + 
•f .# .1 ># 


I L L L 

+ + + + 

M ^ « 

^ ^ ^ ^ 
I I I I 

1^ 

+++ + 

« « • 

lO I 0»0 


;a» 

+ + 

I I 


;afe ;aft 

+ + 


di 

<0 





+ 

m 

+ 

+ 

•m 

+ 

• 

+ 

c* 

39^ 

1 

1 

1 

3&i 

1 

3a» 

T 

39» 

•E 

39» 

• 

S9ft 

»• 

dft 



^ ;a» ;a» sa> ;a» 


; 3 > A 

I I I I I 


;a» ;a» dfe ;a» 

TTTTT 

Sife Sk tSift 


;a) d» d» ;a» 

I I I I I 

•« M M .» « 

;a» ;&) ;a» ;a) 

CQ CO CO CO CO 

4 “ -f- + + + 

9^ m ^ m m 

a» d» ;a» ;a» ; 3 » 

CO CO CO CO CO 


I I 

•• « 

;a» ;ai 


I I 

m >« 

;ai ;a» 



^ ;a» ;a» ;a» ;a» 

+ + + + + + + 

^ M 

;ai d» ;a» ;ak 

Cl Cl d Cl C4 Cl Cl 

I I I I I I I 


sik 

+++++++ 
L L ', L L I L 

Sll 39^ dk 39fe Slfe 3^ 


^ 3^ 3^ 3I| 399 39i 39fe 39^ 

I I I I I I I I 




Table 4. Horisontal Differences. 


















Amt. 16] 


DIFFERENCES 


51 


X 

V 

6y 


A*y 

A*y 


A*y 

1.5 

2.12928 

24629 






1.6 

2.37557 

27006 

2377 

271 




1.7 

2.64563 

29654 

^8 

297 

26 

3 


1.8 

2.94217 

32599 

2945 

326 

29 

4 

1 

1.9 

3.26816 

35870 

3271 

359 

33 



2.0 

3.62686 

39500 

3630 





2.1 

4.02186 








X 

V 

AiV 



Ao/ 

Aft]/ 

A«y 

1.5 

2.12928 







1.6 

2.37557 

24629 






1.7 

2.64536 

27006 

2377 





1.8 

2.94217 

29654 

2648 

271 




1.9 

3.26816 

32599 

2945 

297 

26 



2.0 

3.62686 

35870 

3271 

326 

29 

3 


2.1 

4.02186 

39500 

3630 

359 

33 

4 

1 


It will be observed that the differences for the seven functional values 
are the same whether written as diagonal differences or as horizontal 
differences. 

In certain work, however, horizontal difference tables have distinct ad- 
vantages. In the numerical solution of differential equations, for example, 
the functional values behind us are always known, but those ahead of us 
are always unknown. IJere the horizontal difference table shows all orders 
of differences on the same line as the last known value of the function, and 
these differences %re used for finding the next computed value of the func- 
tion. The horizontal type of difference table is more compact and con- 
venient in this case than the diagonal type would be. 

On the other hand, when we take up the study of central-difference inter- 
polation in Chapter IV, we shall find that a diagonal difference table is 
much better for that purpose. 
































52 


INTERPOLATION— NEWTON’S FORMULAS 


[Chap. II 


17. Effect of an Error in a Tabular Value. Let yo> Vu j/i," ’ ' be 
the true values of a function, and suppose the value ys to be affected with 
an error c, so that its erroneous value is -f~ <• Then the successive 
differences of the y’s are as shown below : 


y 

Ay 


A*y 

A*y 

Po 

Ay, 

• 



Vi 

Ay, 

A«y. 

A*y, 


Vt 

Ay, 

A»y, 

A*y. 

A«y, 


Ay, 

A*y, 

A*y, + « 

A«y, + c 

y* 

Ayo + « 

A*y, + * 

A»y, — 3e 

A*y. — 4c 

ya + 

c 

A*y« — 2€ 


A*y, + ac 


Ay, — * 


A*y4 + 3c 


y% 

Ay. 

A*y, + • 

A*y, — c 

AVa — 4c 

yi 

Ay, 

A*y, 

A*y, 

A«y, + • 

y% 

Ay. 

A»y, 

A*y, 

A*y, 

y% 

Ay, 

A*y, 



yi* 





Table 5. Showing the effect of an 

error in the tabular values. 

This table shows that the effect of an error increases 

with the successive 

differences. 

that the coefficients of the c’s 

are the binomial coefficients with 

alternating signs, and that the algebraic sum of the errors in any difference 

column is zero. It shows also that the 

maximum error in the differences 

iS in the same horizontal line 

as the erroneous tabular value. 

The following table shows the effect of 

an error in a 

horizontal difference 

table: 





y 

All/ 

Aof 

Aj^ 

Aay 

yu 



u 


Pt 

AiVi 




Vt 

A,y, 

Aof, 


« 

Vt 

All/, 

All/, 

Au/, 


Vt 

AiV. 

A,y« 

Ajt, 

Ad/« 

y# + « 

A,y, + 1 

A,y, + • 

A,y, + € 

Ajr, + c 

y« 

Aof,— * 

A,y. — 2c 

A,y. — 3c 

Au/,— 4c 

yf 

Aoh 

A,y, 4- « 

Arff, + 3c 

Ad/, + «c 

ys 

Aat, 

A,y, 

AWi — • 

Aj/, — 4c 

y* 

AOh 

A*y, 

Au/, 

A./, + c 

yiff 

Aau 

Au/w 

Au/m. 



Tabu e. 





Art. 17 ] 


ERRORS IN TABULAR VALUES 


53 


Here, again, the effect of the error is the same as in the preceding table, 
but in this table the first erroneous difference of any order is in the same 
horizontal line as the erroneous tabular value. 

The law according to whicli an error is propagated in a difference table 
enables us to trace such an error to its source and correct it. As an illus- 
tration of the process of detecting and correcting an error in a tabulated 
function, let us consider the following table : 


X 

y 

AiV 

Asy 



c 


0.09983 






■19 

0.14944 

4961 





Wmm 

0.19867 

4923 

- 38 





0 24740 

4873 

- 50 

-12 



0.30 

0 29552 

4812 

- 61 

-11 

1 


0.35 

0.34290 

4738 

- 74 

-13 

- 2 


0.40 

0.38945 

4655 

- 83 

- 9 

4 

c 

0.45 

0.43497 

4552 

-103 

-20 

-11 

— 4 e 

0.50 

0 47943 

4446 

-106 

- 3 

17 

6 c 

0.55 

0.52269 

4326 

-120 

-14 

-11 

— 4 c 

0.60 

0.56464 

4195 

-131 

-11 

3 

c 

0.65 

0.60519 

4055 

-140 

- 9 

2 


0.70 

0.64422 

3903 

-152 

-12 

- 3 



Here the third differences are quite irregular near the middle of the 
column, and the fourth differences are still more irregular. The irregularity 
begins in each column on the horizontal line corresponding to x — 0.40. 

Since the algebraic sum of the fourth differences is 0, the fourth dif- 
ferences found in this example are accumulated errors. Referring now to 
Table 6, we have 

4c n, 6c = 17, etc. 

Hence, c = 3 to the nearest unit. The true value of y corresponding to 
X — 0.40 is therefore 0.38945 — 0.00003 = 0.38942, since (y* + c) — t = yit. 
The columns of differences can now be corrected, and it will be found that 
the third differences are practically constant. 

If an errer is present in the data, the differences of some order will 
become alternating in sign. Carry the differencing far enough to allow 
the error to be revealed in this manner. 


* Notr. VVlion writing nunicrifnl differonco tables, or when subf^titutinj: numerical 
differences in formulas, it is customary to omit the zeros between the decimal {Hjint 
and the first significant figure to the right of it; in other words, the differences are 
expressed in units of the last figure retained. Thus, instead of writing — 0.00038 as 
the first number in the column A,y we write simply —38. This practice will be 
followed throughout this book, except in a few instances where the zeros are written 
for the sake of clearness. 



54 


INTERPOLATION— NEWTON'S FORMULAS 


[Chap. II 


If several tabular values of the function are affected with errors the 
successive differences of the function will become irregular, but it is not 
an easy matter to determine the sources and magnitudes of the separate 
errors. 

In the case where each of the tabulated y’s is affected with an error 
of magnitude c, each of the third differences is affected with an error 

— 3<k.i + 30-2 — ck-S) each of the fourth differences with an error 
<fe — 4* 6 *k -2 — 4<k-8 + «*-«> ®tc., as is evident from Tables 3 and 4. 

In practical problems the tabulated values of the function y are obtained 
by measurement or by computation. They are thus liable to be affected with 
errors of measurement or with errors due to rounding off the computed 
results to the given number of figures. In either case these errors would 
be magnified in the process of taking differences and they alone would be 
sufficient to cause the higher differences to become irregular.* For this 
reason it is usually not advisable to use differences higher than the fourth. 

18. Relation Between Differences and Derivatives. It is sometimc.s 
desirable to know the relations which exist between differences and deriva- 
tives. The fundamental relations between them are : 


(18. 1) 
and 


A»/(x) — (Aa:)*/<") (x 9nAx), 


lim 


A"/(g) 

(Ax)" 


/<•)(*). 


0 <*< 1 


These relations are derived in yall4e-Poussin’8 Cows d’ Analyse Infiniiesi- 
male, I (fourth edition, 1921), pp. 72-73. 

19. Differences of a Polynomial. Let us now compute the successive 
differences of a polynomial of the nth degree. We have 


(1) y — /(x) — • ox" -f- 6x""‘ + + ' ’ • 

( 2 ) .*. y Ay — o(x -f A)" -j- 6(x -f fc)"'^ -1- c(x + fc)""* + ‘ ’ ‘ 

+ ^(® + ^) + 

where h — Ax. 

Subtracting ( 1 ) from ( 2 ), we get 


Ay — a[(x-{- fc)* — X"] -j- b[(x-f-fc)"~^ — x""^] 

-f c[(x-4-fc)»-* — X"-*] -f • • • + kh. 


* For an exhaustive discussion of errors in the tabular values of a function, see 
Rice’s Theory and Practice of Interpolation, pp. 7-16 and 40-02. Also 0. Biermann’a 
Vorleeungen dber MathematieeiKe Ndherungemethodon, p. 130. 



Abt. 19] 


DIFFERENCES OF A POLYNOMIAL 


55 


Expanding the quantities etc. by the binomial 

theorem, we have 

Ay — fl -f n*«-‘ + 

+ • • • — a;»J + 6 + (f»— l)Aa:»-* + (^— — 2) . 

+ • • • — x"-»] + c [®"-* 4 - (n — 2) Ax-» + 

+ * • • — x"~*^ + • ’ • ^kh, 
or 

Ay — anAx»-‘ + [ofc* + 6 (n — l)fc] *-» 

4 . [^oA» 4 -ftA* (”— 4 -cA(n— 2 )] x-* 4 -- • 

Now if Ax(— A) is constant, the bracketed coefficients of x""*, x"**, etc. 
are constants, so that we may replace them by the single constant coeffi- 
cients b\ cf, etc. Hence we have 

(3) Ay — anhx*-^ 4* A'x*"* 4- c'x"** 4- ' ' ' -{-Vx •\-V. 

The first difference of a polynomial of the nth degree is thus another poly- 
nomial of degree n — 1 . 

To find the second difference we give x an increment Ax — A in (3) 
and therefore have 

(4) Ay 4- A(Ay) — anA(x -f- A)^"‘‘ 4" + A)""* 

4-c'(x4-A)"-*4-- • •4-]b'(x4-A) -nr. 

Subtracting (3) from (4), we get 

A(Ay) — A*y — anA[ (x 4 - A)"‘‘ — x*‘^] 

4-A'[(x4-fc)"-* — »"-*]+ c'[(» + A) — • + 

• 

Expanding (x-f-A)"-*, (x-f A)""*, etc. by the binomial theorem and re- 
placing the constant coefficients of x^, x”*^, etc. by a single letter as 
before, we have 

A*y —«»(»— + Vx^ 4- • • • 4- fc^'x 4- T. 

The second difference is thus a polynomial of degree n — 2 . 



56 


INTERPOLATION— NEWTON’S FORMULAS 


[Chap. II 


By continuing the calculation in this manner we arrive at a polynomial 
of zero degree for the nth difference ; that is, 

A"y*>»a[n(n — 1)(» — 2)- • • ■— on!A"a:® ofc"n!. 

The nth difference is therefore constant, and all higher differences are zero. 

The reader should bear in mind thal;this result is true only when h is a 
constant, that is, when the values of x are in arithmetic progression. 

The proposition which we have just proved may be stated as follows: 

The nth differences of a polynomial of the nth degree are constant when 
the values of the independent variable are taken in arithmetic progression, 
thai is, at equal intervals apart. 

The converse of this proposition is also true, namely : 

If the nth differences of a tabulated function are constant when the values 
of the independent variable are taken in arithmetic progression, the function 
is a polynomial of degree n.* 

This second proposition enables us to replace any function by a poly- 
nomial if its differences of some order become constant or nearly so. Thus, 
the function tabulated in Art. 1? can be represented by a polynomial of 
the third degree, since the corrected third differences are approximately 
constant. 

20. Newton's Formula for Forward Interpolation. Our next problem 
is to find suitable polynomials for replacing any given function over a 
given interval. Let y — /(x) denote a function which takes the values 
Voi Vu Vit' ‘ ' Vn for the equidistant values Xo, x„ Xj, - ■ • Xn of the inde- 
pendent variable x, and let ^(x) denote a polynomial of the nth degree. 
This polynomial may be written in the form 

(1) <h(x) — ao-f-o,(a:— Xo) +ai{x — Xo){x — Xi) 

+ o, (x — Xo) (x — X,) (x — x,) 

+ 04 (X — Xo) (x — Xi)(x — Xj)(x — X,) 

+ • • • -)-a»(x — Xo)(x — xi)(x — X,) • • (x—Xn-i). 

We shall now determine the coefficients 0 ^, 01 , at, - • - 0 % so as to make 
♦(Xo) — yo, ^(xi) — yi, ^(Xj) — y*,- • -^(Xn) — y,. 

Substituting in (1) the successive values Xo, Xi,Xi, ** -Xn for x, at 
the same time putting ^(xo) — yo) ^(xi) — yi, etc., and remembering that 
Xi — Xo — h, Xo — Xo — 2fc, etc., we have 

* For the proof of this propoeition see Rice's Theory and Praetioe of Interpolation, 

p. 24. 



Art. 20] 


NEWTON'S FORMULA (I) 


67 


y© "■ ^ 0 } or clq yo« 

jfi **■ do -|- d\ (a?i — Xq) “ y© “t" 

. „ yi — yo ^ 
ft " ft • 


ys — ao + a,(x, — j-„) +aj(a:i — a:o)(*2“«i) == J/o + (2A) 

+ a^{%h){h). 

«, ^2 — 2yi + yo A*yo 

®* " 2A* “ 2A* • 

ya — Oo + ai(a :3 — a’o) +02(2:3 — a:o)(rs — a:,) 

+ Cs(ia — a:o)(2:3 — Xi)(a:3 — *2) 

-yo+ (m + ( 3 A)( 2 fe) +03(3/1) ( 2 A)(;i). 

^ ya — 3ys + 3yi — yp A*yo 
^ ' 6A* “3!A»' 

y 4 *-ao + ai (®4 — aro) + 02(2:4 — Xo) (a:. — Xi) 

O3 (3:4 — “ 2:^) (2:4 2?! ) (2:4 ”“2?2) 

+ 04 ( 2:4 — Xo) (X 4 — a:,) (X 4 — Xz) (x« — X3) 

+ (4*)(3A) 

+ (44)(3t)(2») +0,(4»)(3»)(2»)(*). 


. y* — 4ys + 6y, — 4y, + y, A«yo 

* *“ 4!A* ■■4!/i«’ 

By continuing this method of calculating the coefficients we shall find that 


0 ,— 


■5 !A» ’ 


a« 


A*y, 

6!A*’ 


On 


A"yo 
n!A» • 


Substituting these values of Oo, Oi, * * 'On in (1), we get 
(2) ^(x) — yo+^(a: — Xo) +^(® — ®o) (* — ®i) 

+ ^j|7(» — ®o)(® — X|)(X — x,)(x — X,) +• • • 

A**Vo 

+ ;^(» — ®o)(x — x,)(x — x.) • • (x — x,.i). 

This is Newton’s formula for forward interpolation, written in terms of x. 



58 


INTERPOLATION— NEWTON’S FORMULAS 


[Chap. II 


The formula can be simplified by a change of variable. Let us first write 
(2) in the following equivalent form: 


A*yo/x—Xf^\/x—Xi\ f x—xt \ 

+ 3\\ h )\ h )\ h ) 

, A^yo{x — x^\fx — Xi\/x — Xt\fx—x,\ , 

+ 4l\ h )\ h )\ h )\ h 

Now put 

* — 1 i 

— I — - — tf, or * — *0 + »«• 


Then since Xi — a;* + K + 2A, etc., we have 


X — Xi X — (xo + h) * — «o — h X — Xq h 

h ^ h 

X — X* X — (xo + 2k) X — Xo 2k ^ 

A " A "A A 


X — Xn-t X — [®o+(» — 1)A] X — Xo (n — ^J'A 

A ” A “"A A " 

— tt — (n — 1) — ti — n + 1. 

Substituting in (3) these values of (* — Xo)/h, (® — Xi)/h, etc., we get 

(I) 0(j) — <f»{xo +Au) -= g{u) — uAyo + A*yo 

^ u {^ l)_ iu-2) ^ «(u-l)(u-2)(u-3) ^,y^ 

j tt(tt— l)(tt— 2)- • -(tt— n + 1) 

-t- + ^yo. 


This is the form in which Newton’s formula tor forward interpolation 
is usually written. We shall refer to it hereafter as Newton’s formula 
(I).* It will be observed that the coefiicients of the 4’s are the binomial 
coefficients. 


* Recent hietorical inveetigetion Am shown that this formula was really first dls> 
covered by Jamee Gregory as early as 1670. 



Art. 21] 


NEWTON’S FORMULA (II) 


59 


The reason for the name " forward interpolation formula lies in the 
fact that the formula contains values of the t&bulated function from 
onward to the right (forward from yo) and none to the left of this value. 
Because of this fact this formula is used mainly for interpolating the values 
of y near the beginning of a set of tabular values and for extrapolating 
values of y a short distance backward (to the left) from y^. 

The starting point y^ may be any tabular value, but then the formula 
will contain only those values of y which come after the value chosen as 
starting point. 

21. Newton’s Formula for Backward Interpolation. The formulas of 
the preceding section can not be used for interpolating a value of y near 
the end of the tabular values. To derive a formula for this case we write 
the polynomial ^(x) in the following form: 

(1) ^(x)— ao + ai(x — xn)+aa(x — Xn)(x — x*.i) 

+ Oa (x — X,) (x — X,.i) (x — X*.*) 

+ a 4 (x — x»)(x — x«_,)(x — x,-*)(x — x«.s)4-- • • 

+ o»(x — x,)(x — x«-i)- • • (x Xx). 

Then we determine the coefficients • - Un so as to make ^(x*) •— y^, 

^(x«.i) —yn-i, etc. Substituting in (1) the values Xn, Xn-i, etc. for x and 
at the same time putting ^(xn) -= yn, ^(Xn-i) —yn-i, etc., we have 

yn — a®, or o® -= . 

y»-i — Oo + <»i(*n-i — *®) ““yii + ai( — A). 

. _ ifn y»-i 

• * h if • 

^(^ 11-2 3 ?||) ( 3 / 11-2 

- (- 2A) + «*(- 2A) (- A)- 

• „ gn — 2y«-i + y<i-» ^tPi. 

• • ®* ” .2ft* *“ 2ft» * 

By continuing the calculation of the coefficients in this manner we shall find 

^ Aay, A4y» ... „ 

*“"3!ft»’ *““4!ft*’ ^“■filft" 

Substituting these values of a®,ai, Oj, etc. in (1), we have 


5 



60 


INTERPOLATION— NEWTON’S FORMULAS 


[Chap. II 


(2) (* ®») “I" (® *»)(* — ®"-i) 

(* — ®»-a) 

+ (® — X,) (x — X,., ) (« — X,.,) (x — X,.,) H 


nlfc" 


(* — Xj). 


This is Newton’s formula for backward interpolation, written in terms 
of X. It can be simplified by making a change of variable, as was done 
in Art. 20. 

Tiet us first write (2) in the equivalent form 

( 3 ) + ^{^){^) 

, A,y, /x — x,\/x — x,-i\/x — x».,\ 

^ 31 \ h )\ h )\ h ) 

+ ^(^)(^)(^)(^) +• • • 


Now put 




u — 


or X — X, + 


Then since Xn-i — x» — h, x»_» — x» — 2h, etc., we have 

X Xn-i 


X — {Xn — h) X- 

h " 


X — x,.j X — (x« — 2h) 

A “ h 


— h r^+ ¥"“+’■ 

X — X, , 2^ , 5 

— « + 2, 


— Xi X— [x,— (n— 1)A] X— X, , (n— 1)A , 

_ _ ^ ^ ^ _u-[-n— 1. 


Substituting in (3) these values of (x — x»)/A, (x — x«.i)/A, etc., we get 
(II) ^(x) — ^(x, + hu) — ^(tt) ■— yn + ttA,y„ + - ^ Aiy. 

+ + 1) (« ■ + g) A.y, + + 3) ^ . . . 


+ 


it(i* 4* 1) (« + 2) • • -(tt + n — 1) 

nl 





Art. 21] 


NEWTON’S FORMULA (II) 


61 


This is the form in which Newton’s formula for background interpola- 
tion is usually written. We shall refer to this formula hereafter as Newton’s 
formula (II). It is to be observed that this formula employs horizontal 
differences, whereas the formula for forward interpolation employs diagonal 
differences. 

(II) is called the formula for “backward” interpolation because it con- 
tains values of the tabulated function from backward to the left and none 
to the right of yn. This formula is used mainly for interpolating values 
of y near the end of a set of tabular values, and also for extrapolating values 
of y a short distance ahead (to the right) of yn. 

We shall now illustrate the use of Newton’s formulas by working some 
examples. 

Example 1. Find log^o*’) having given 

log 3.141 — 0.4970679364, 
log 3.142 — 0.4972061807, 
log 3.143 — 0.4973443810, 
log 3.144 =. 0.4974825374, 
log 3.145 — 0.4976206498. 


Solution. 

We first form the table of differences. 

as shown below: 

X 

y — log I 

Ay 

A^y A’y 

3.141 

0.4970679364 

1382443 


3.142 

0.4972061807 


— 440 



1382003 

1 

3.143 

0.4973443810 


— 439 



1381564 

1 

3.144 

0.4974825374 

1381124 

— 440 

3.145 

0.4976206498 

m 



Here * — ir -■ 3.1416926536, Xo — 3.141, h — ■ 0.001. Hence 


x — x, 3.1415926536 — 3.141 

1 n 


• h 

0.001 



u— 1 — 

— 0.4073464, etc. 


Substituting these values in (1), 

Art. 20, we get 




62 


INTERPOLATION— NEWTON’S FORMULAS 


[Chap. II 


log,, 0.4970679364 + 0.5926536(1382443) 

0.5926536 (— 0.4073464) (— 440) 

“*■ 2 

— 0.4970679364 + 0.0000819310 + 0.0000000053 

— 0.4971498727 . 

This result is correct to its last figure. 

Example 2. Using the tabular values of the preceding example, find 
logio 3.140. 

Solution. Here a: — a:., — 3.140, x, 3.141, h — 0.001. Hence 
x-x, _ x.. — Xo 

’ - A h h 

tt — 1 — — 2, etc. 

.Mogio 3.140 — 0.4970679364+ (—1) (1382443) + (— 4401 

— 0.4970679364 — 0.0001382443 — 0.0000000440 

— 0.4969296481 . 

This result is also correct to its last figure. 

Note. The process of computing the value of a function outside the 
range of given values, as in the example above, is called extrapolation. 
It should be used with caution, but if the function is known to run 
smoothly near the ends of the range of given values, and if h is taken as 
small as it should be, we are usually safe in extrapolating for a distance 
h outside the range of given values. 

Example 3. The hourly declination of the moon for January 1, 1918, 
is given in the following table. Find Ijhe declination at 3*^ 35™ 16*. 


Hour 

Declination 

Ai 

A. 

A« 

0 

8* 29' 53'. 7 




1 

8 18 19 .4 

-11' 34' . 3 



2 

8 6 43 .6 

-11 36 .9 

-r.6 


3 

7 66 6 .1 

-11 37 .4 

• -1 .6 

O'.! 

4 

7 43 27 .2 

-11 38 .9 

-1 .6 

! 

mmmun 


Solution. Since the desired declination is near the end of the values 
given we use Newton’s formula (II), and we therefore form a horizontal 
difference table, as shown above. Denoting the time in hours by t, we 
have fn — 4, / — 3‘‘ 36“ 16*, A — 1. Hence 












Abt. 21] 


EXERCISES 


63 


u 



— O'* 24“ 45* —1485* 

!•* “ 3600* 

tt + 1 — 0.6431. 


— 0.3569. 


Substituting these values in (II) and denoting the required declination 
by 3, we get 

8 — 7“43'27".2 +(—0.3569) (— ir38".9)+ (— Q-3569) 

IS 

— 7® 43' 27".2 + 4' 9".4 + 0".2 

— 7® 47' 36".8. 


Example Using the data of the preceding problem, find the declina- 
tion of the moon at < — S'*. 

Solution. Here t — — 5, — 4. 

. U + 1-2. 

Substituting in (II), we have 

= r 43' 27".2 + (1) (— 11' 38".9) + (— 1".5) 

— 7° 31' 46".8 . 

The true value, as given in the American Ephemeris and Nautical Almanac, 
is 7® 3l'46".9, the error in the extrapolated value thus being only 0".l. 


EXERCISES n 

1. Find and correct by means of differences the error in the following 
table: 

20736 

28561 

38416 

50625 

65540 

83521 

104976 

130321 

160000. 



64 


INTERPOLATION— NEWTON'S FORMULAS 


[Chap. II 


8 . Correct the error in this table: 


19" 

12' 

22".4 

19 

25 

64 .7 

19 

39 

7 .3 

19 

61 

63 .8 

20 

4 

31 .9 

20 

16 

43 .6 

20 

28 

34 .3. 


3. Find logic sin 37' 23^', given 

log sin 37' — 8.0319195 — 10 
« « 38' — 8.0435009 — 10 
'« « 39' — 8.0547814 — 10 
« « 40' — 8.0657763 — 10 
" " 41' — 8.0764997 — 10 
« « 42' — 8.0869646 — 10 
« « 43' — 8.0971832 — 10. 

4 . The following table gives the longitude of the moon at twelve>hour 
intervals for the first four days of April, 1918. Find the moon's longitude 
at 8 :50 P. M. on April 2, the day beginning at noon. 


Apr. 

1 

0 

244" 

44' 

20".5 

i€ 

1 

12 

260 

67 

36 .7 

€€ 

2 

0 

257 

14 

22 .1 

i€ 

2 

12 

263 

36 

8 .6 

€€ 

3 

0 

270 

0 

24 .6 

€i 

3 

12 

276 

30 

39 .6 

€€ 

4 

0 

283. 

6 

22 .1. 


5 . Using the data of Exercise 3, find log sin 42' 13". 

6. Using the data of Exercise 4, find the moon’s longitude at 8:43 
P. M., Apr. 3. 



CHAPTER 111 


INTERPOLATION WITH UNEQUAL INTERVALS 
OF THE ARGUMENT 

28. Divided Differences. The interpolation formulas derired in the 
preceding chapter are applicable only when the values of the functions are 
given at equidistant intervals of the independent variable, or argument. 
It is sometimes inconvenient, or even impossible, to obtain values of a 
function at equidistant values of its argument, and in such cases it is 
desirable to have interpolation formulas which are applicable when the 
functional values are given at unequal intervals of the argument. Two 
such formulas are Newton’s formula for unequal intervals of the argument 
and Lagrange’s formula. The former employs differences, but the latter 
does not. The differences used in the Newton formula are called divided 
differences, which are differences obtained in the usual manner and then 
divided by certain differences of the argiiment. Hence the name. 

Let yo,yi,y>,' ' t/n denote functional values corresponding to any values 
Xo,Zi,Xt,' ’ of the argument. Then the divided differences of y in 
ascending order are defined as follows: 

First ordsr divided differences: 

Xx “ Xi X$ — <^2 

Second order differences: 

*1 — *• 

■= «(*„ X,, Xi) 

• etc. 

Third order differences: 

X» — Xq 

,*4 — *l 

etc. 

Fourth order differences: 

H**, Ok, X m, gj) gj, g|, gg) _ j. ^ x„XuX,) 

etc. 

65 



66 


INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 


Note that the order of any divided difference is one less than the number 
of values of the argument m it. 

If {xo,yo), {xt,y\), {X 2 ,y 2 ), etc. are points on a curve, the first-order 
divided difference is the slope of the secant line through any two points. 


23. Tables of Divided Differences. Tables of divided differences are 
shown below. 


a 

y 

6^ 


a* 

a* 

% «0 

1^0 






yi 


S(x„x„xa) 







3 { dPj, Xif Xq ) 



y» 


3 ( Xj, Xgf Xi ) 

4 ( x^f x^f J?i ) 

^(®4» J?8» '**11 ®o) 


Vm 


3lx^,Xi,Xs) 


a ( afs, j?4, x^f x^f Xi ) 


Vi 


3 (x^t •^*4* ^») 

»♦.»*) 

a(are,a?5,aP4,»„»,) 



«(*»»,) 

^ ( ®0l ®6l ) 




yt 






X 

y 

a* 

a* 

1 «» 

a* 

a» 

— 2 

— 792 

450 

f 




0 

108 

— 60 

— 102 

18 



3 

— 72 

60 

24 

— 9 . 

— 3 

0.6 

5 

48 

— 96 

— 39 

7 

, 2 


7 

— 144 

— 108 

— 4 




8 

— 262 

j 















Abt. 24] 


SYMMETRY OF DIVIDED DIFFERENCES 


07 


The second entry in the 8*-column above, for example, was found as 
follows : 

8, ( 60) 

6—0 

For the first entry in the 6*-column we have 

-9-18 

7-(-2)- 

The other entries were found in a similar manner. 

24. Symmetry of Divided Differences. Divided differences are sym- 
metric functions of their arguments, as indicated below for particular cases. 


Also, 


HTuXo) = ^ ^ ^ ^ — S(®o,®i) 

Xq Xq X\ 




yi _ yo 


_yi I yo 


Xq X\ X^ Xy ““ Xq Xq Xy 

«(®*, ®i, ®o) - 


a?2 — ^0 


L_ r_J!!_ + _!^ 

Xi — Xo LX2 — Xi Xi X2 \x, — Xo Xo Xi/J 

L_ r y» . „ 1 \ y» “] 

®2 Xo L.X2 ®* Xi Xo/ Xo XiJ 


y* 


y* 


yo 


_ I I 

(Xi — Xo)(X 2 — Xi) (Xx — XoXXi — Zi) (®o — ®l)(*0 

ijxa, Xi, Xi) — %{Xi, Xi, Xq) 


X, — Xo* 


8(®8> ®S, ®o) **“ 

_ _ J r y« I yt 

. ““x, — *0 L(®8 — ®0(®« — ®») ~ (*2 — — *8) 

yx ( yt 

(Xi X^(Xi Xi) \(®i“~®o)(®8 — ®i) 


+ 

+*J 


yi 


+ 


yo 


(Xi — Xo)(Xi — X,) (Xo Xi)(Xo — x,))l 

_ 1 r y« I y / 1 

X,— Xo L(x, — x,)(x, — X,) \(Xa — x,)(x, — X,) 
“ (x, — Xo)(Xa — Xi)) ((Xi — X,)(Xi— X,) 

1 go 1 

(Xi — Xo)(Xi — X,)/ (Xo — Xi)(Xo — X,)J* 


®8) 



INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 


or 


«(*„ *„ Xo) __ __ ^ ___ J. __ 

+ h + yj 

(*1 — *o)(*i — ®»)(*1— »*) (»0— ®i)(®c — *a)(»0 — ®i) 

« 

The right-hand members of the aboye equations remain unchanged when 
any two values of x are interchanged and the corresponding y*s are also 
interchanged. This means that a divided difference remains unchanged 
regardless of how much its arguments are interchanged. Thus, 


«(1, 6, 9) — 8(6, 9, 1) — 8(1, 9, 6) — 8(6, 1, 9), etc. 


We may therefore write 


8(®*,3S»-i, ■ ‘ *a, 35i»*o) ^8(#o» 


etc. 


It can be proved by mathematical induction that 

+ _ h 

^ (Xq — x)(Xo — ®i) • • • (*0 — Xn) 

4. !fl +. . . 

^ (x, — x)(Xi — Xo) • ' • (x, — Xn) 

, JU 

(*» Xl){Xn — Xo)(Xii — ®i) ■ ■ {Xn — Xn-t) 

85. Relation Between Divided Differences and Simple Differences. 
The relation between divided differences and simple differences can be 
found by starting with a set of functional values corresponding to equi- 
distant values of the argument, constructing a table of simple differences 
and a table of divided differences of these functional values, and then 
comparing differences of the same order in the two tables. For the simple 
difference table we have: 



Abt. 25] RELATION BETWEEN DIVIDED AND SIMPLE DIFFERENCES 09 



And for the divided differences the table is: 


























70 


INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 


From these tables we see, for example, that 

i^jfo " "f" Xo -f" 3h, Xo 2h, Xo -f* h, Xo) 

y« — *ya + 6y« — ^yi + yo 

” 4!h* 


and more generally. 


(26.1) 


6"yk ■■ + ah, ®i( -f- (n — l)h,‘ 'SSk + h,®*) 


A"yfc 

n!h"* 


h — 0,1,2,- • 


In Art. 19 it was shown that the nth simple differences of a polynomial 
of the nth degree are constant. Since (26.1) is an nth degree simple 
difference divided by the constant product n!/»", it follows that the nth 
divided differences of a polynomial of the nth degree are constant. Hence 
the (n-|-l)<A divided differences of a polynomial of the nth degret> are 
xero. 


26. Newton’s General Interpolation Formula. Newton’s general inter- 
polation formula for unequal intervals of the argument can be derived 
by starting with any variable pair of values x and y and the pairs of given 
values, writing down the divided differences in ascending order, and then 
solving for y in successive steps until as many terms as desired are found. 
Thus: 


(a) 

(b) 

(c) 

(d) 

(e) 

(f) 


■yo 


— «(x,a;o) 


X — Xo 

8(a;, Xo) — 8(go, g, ) 

X — Xi 

S(x, Xq, Zi) — S(Xo, Xt, Xa) 

X — X, 

8(x, Xq, Xx, X 2) •**— 6(xo, Xt, Xjf Xg) 


X Xt 


8(x,x«,x,) 

8(x,Xo,x„x,) 

— 8(x,x„xi,x„x,) 


8(x, Xo, X,, Xg, Xj) 8(xo, Xj, X2, Xo, X4) jy. \ 

— — o(x, Xo, Xi, Xa, Xo, X4) 

X X4 « 


i(^Xy Xqj Xiy X2y Xsp X4) — Xlf X2f X$, X4, X5) 

X — x§ 

®a)« 

From (a), 


(g) 


y — yo + (*— ®o)«(», *0). 



Art. 26] 


NEWTON’S GENERAL INTERPOLATION FORMULA 


71 


From (b), 

«(®, ®o) — «(»0, ®l) + (® — *0, »l). 

Substitute into (g) this value of S(x,a;o) and get 

(h) y — yo + (* — *o) [«(®o, + (« — «o, ®i)l 

— yo + (» — »o)«(«o, «t) + (® — a:o)(» — »o, ®i). 

From (c)^ 

h{x, xo, Xi) — «(®o, ®*) + (a? — *2)8(3:, *0, »i, *2) 

Substitute this into (h) and get 

y — yo + (® — *o)8(*o, * 1 ) + (a: — a:o)(* — x,) [^(Xo, Xi, x,) 

... + (a: — X2)8(x, Xo, x,, * 2 ) J 

— yo + (a: — Xo)8(Xo, Xi) + (x - - ®o)(x — Xi)8(xo, Xi, * 2 ) 

+ (x — Xo)(x — xr)(x — x»)i(x, Xo, Xi, X,) 

From (d), 

8(x, Xo, Xi, X2) — 8(xo, Xj, x„ X,) + (x — x,)8(x, Xo, Xi, X2, x,) 
Substitute into (i) and get 

y — yo + (a: — Xo)8(xo, x,) + (x — Xo)(x — Xi)8(xo, x,, x,) 

-I- (x— Xo)(x— Xi)(x— x,)[8(xo, xi, * 2 , xs) 

(j) +(*— a:8)8(x,Xo,Xi,x„x,)l 
— yo + (a: — Xo)8(xo, x,) + (x — Xo)(x — X2)8(xo, Xi, x,) 

+ (x — Xo)(x — Xx)(x — X,)8(Xo, x„ * 2 , X,) 

+ (x — Xo)(x Xx)(x X2)(x X2)8 (x, Xo, Xi, X„ X,). 

From (e), 

8(x, Xo, Xi, *2, Xo) ^ 8(Xo, X|, *2, Xo, X4) 

+ (x — x«)8(x, Xo, Xx, X„ Xo, x«) 

Substitute this into ^j) and obtain 

(k) y — yo + (as— Xo)8(xo,Xx) + (x— Xo)(x— Xx)8(xo,Xi,x,) 

+ (x — Xo)(x — x,)(x — X*)8(Xo, X„ Xo, X,) 

+ (X — Xo)(x — Xx)(x — X,)(X X,)8(Xo, Xx, X„ Xo, x«) 

+ (x — Xo)(x — x,)(x — x*)(x — Xo)(x — x«)8(x, Xb, Xx, Xo, x„ x«). 

By continuing in this manner, or by mathematical induction, it can be 
proved that the general Newton formula with divided differences is 



72 INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 

(86. 1) y — yo + (* — »o)8(®o> ®i) + (» — ®o)(a: — ®i)8(®o, »•) 

+ (*— aJoXa:— a;i)8(iFo,ari,3:„a:,) + - • • 

+ (x— a;o)(«— *i)- • (a?— aW-i)8(a;o,»i>*j,- * aw) 

+ (» — Xo)(t — X,)- • (X — X,)8(x,Xo,Xi,Xa,- • x,). 

The last tenn in this formula is the remainder term after n 1 terms, 
or Bn*i. Hence 

(28.2) f?,.! — (x— Xo)(x— xi)(x — xa)- • • 

(x — X||)8(x, Xq, Xi, Xa, • ■ Xu) 

Here it seems worthwhile to state that all terms in the right-hand 
member of (26. 1) have the dimensions of y, regardless of the nature of x 
and of the units in which x is expressed. The x-units or x-dimensions 
always cancel out. For example, the second term in the right-hand member 
of (26.1) is 

in which the x-dimension in the numerator cancels that in the denominator. 
The third term of (26.1) is 

__ (x— xo)(x— xi) r ya— yi _ yi— yo ~[ 

Here the x-dimension is of the second degree in both numerator and 
denominator and therefore cancels out, leaving the term in the dimensions 
of y alone. 

The disappearance of the dimensions of x in the terms of (26. 1) means 
that X may be of any nature and expressed in> whatever units may be 
required. The x’s require no transformation before substitution in (86. 1). 

Example 1. The following table gives certain corresponding values of 
X and logioX. 


Find log 323.6 by Newton’s general formula (26.1). 



Abt. 26] 


ILLUSTRATIVE EXAMPLES 


73 


9 

log* 


8* 

3* 

321.0 

2.50651 

0.00134444 



322.8 

2.60893 

• 

— 0.00000168 




0.00134286 


—0.00000022 

324.2 

2.61081 

0.00133760 

— 0.00000244 


325.0 

2.61188 



j 


Solution. Here 322.8, — 2.60893, iCi — 324.2, X} — 325.0, 

X — 323.6. Hence 

y - log 323.5 - 2.60893 + (323.5 — 322.8)(0.00134286) 

+ (323.6 — 322.8)(323.6 — 324.2)(— 0.00000244) 

— 2.50893 + 0.000940 + 0.0000012 

— 2.60987. 

This result is correct to its last figure. 

Example 2. Find esc 26*’.167 by means of (26. 1) and the following 
table. 


9 

C8Cff 

• 

a» 

a* 

26*.06 

2.36178266 






— 0.087906126 



26M3 

2.36476007 


0.003613126 



• 

— 0.087183600 


— 0.000141907 

26*.26 

2.34428806 


0.0036748106 



• 

— 0.086604286 


-0.0001476607 

25*.32 

2.33823276 


0.0036361188 




— 0.086938667 


• 

86*.41 

2.33049827 


























74 INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 

Solution. Here *o — 25“.18, po — 2.35476007, *i — 25®.26, x, — 25®.32. 
Then from (26. 1) we have 

C8C X — 2.35475007 + 0.037 (— 0. 0871836 ) 

+ 0.037 (—0.083) (0.00357481) 

+ 0.037 (—0.083) (—0.153) (—0.000147661) 

— 2.36475007 — 0.003226790 — 0.000010978 — 0.000000069 

— 2.35151323. 

27. Lagrange’s Interpolation Formula. Let /(x) denote a polynomial 
of the nth degree which takes the values yo> Vu Vt)’ ' ' y* when x has the 
■ 'X«, respectively. Then the (n-|-l)th differences of this 
polynomial are zero (Art. 25). Hence 8(x, Xo, Xt, Xj, • ■ -x,)— 0 and 
(24. 1) becomes 


y , yo 

(X — X«)(X — Xi)(x — X,)- • (X — X,) (Xo — x)(Xo — Xi)(Xo — X*)- • (Xo — X,) 

+ Hi + . . . 

(Xi — x)(x, — Xo)(Xi — X,)- • (Xt— X,) 

+ Hit 0 . 

(®ii ®)(*» ■“ ®o)(®i» •— Xi) ‘ * ■ (Xn — X||_i) 

Transposing to the right-hand side all terms except the first, we have 


y ^ yo 

(X — Xo)(x — X,)(X — X,)- • (X — Xi,) (X — Xo)(Xp — Xi)(x* — X,)- • (Xo — X,) 


(X — X,)(X,— Xo)(Xi — X,)- • (Xi— x») 


(x— x,)(x»— Xo)(x,— Xi)- • (Xp- x_i)' 

Solving for y and then canceling the common factors (x — Xp), (x — Xi), 
(x — Xp),- • • (x — Xn) in the several terms, we get 

« (g— !gi)(g— jgi)' • jx—Xn) 

(Xp — Xi)(xp — Xj) • ‘(Xp — Xp) * 

(x — Xp)(x — Xi)‘ • '(x Xp) 

(Xx — Xp)(Xi — X|) • • • (Xx — Xp) 

(x— Xo)(x— Xx)(x— X,)- • (x— Xp) „ I . . . 
■^(x,— Xp)(x,— Xi)(x,— X,)* • (x,— Xp)^* 

(x — Xp)(x — Xx)(x — Xp)- » (X — Xp-x) 

“•■(Xp — Xp)(Xp — Xx)(Xp — X,)- • (Xp— Xp-x) 


(87.1) y 



Art. 27] 


LAGRANGE’S INTERPOLATION FORMULA 


76 


This is Lagrange’s formula and is seen to give * ’yn when 

X"^Xo,Xi,' ' -Xn, respectively. The values of the independent variable 
may or may not be equidistant. 

The Lagrange formula can also be written in the form 

n JT 

where — (® — ®o)(® — *i) • ■ • (* — *,), 


Since Lagrange’s formula is merely a relation between two variables, 
either of which may be taken as the independent variable, it is evident 
that by considering y as the independent variable we can write a formula 
giving X as a function of y. Hence on interchanging x and y in (27. 1), 
we get 


( 27 . 3 ) 


(y — y»)(y — y2)- • (y — y.) ^ 

(yo — yi)(yo — y2) • • (y. — y.) * 

1 (y — yo)(y — y2) • • (y— y«) ^ 

(yi — yo)(yi — y2) • • • (yi — y.) * 

, (y — yo)(y — yO - • (y — y.) _ , 
(yt — yo)(y2 — yO • • • (y2 — y») * 

I (y — yo)(y— y») • • (y — y«-i) ^ 
(y. — yo)(y. — y.) • • • (y« — y.-O 


The chief uses of Lagrange’s formula are two: (1) to find any value of 
a function when the given values of the independent variable are not 
equidistant, and (2) to find the* value of the independent variable corre- 
sponding to a given value of the function. This second problem is solved 
by means of formula (27.3). 

We shall now work two examples to illustrate these uses. 


Example 1. The following table gives certain corresponding values of 
X and logio x. Compute the value of log 323.5. 


1 

X 

321.0 

322.8 

324.2 

325.0 

logi.x 

2 50651 

2.50893 j 

2.51081 

2 51188 


Solution. Here x — 323.5, Xo — 321.0, Xx — 322.8, x^ — 324.2, x, — 325.0. 
Substituting these values in (27.1), we get 


6 









76 


INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. Ill 


log,. 323.6 


(323.6 — 322.8)(323.6 — 324.2)(323.5 — 325.0) w gftnffjti 
(321 — 322.8)(321 — 324.2)(321 — 326) ^ »-W661 


. (323.6 - 321)(323.6 - 384.2)(323.5 - 325) 
(322.8 — 321)(322.8 — 324.2)(322.8 — 325) 


(323.5 — 321)(323.5 — 322.8)(323.6 — 325) 
+ (324.2 — 321)(324.2 — 322.8)(324.2 — 325) 


(323.5 — 321)(323.5 — 322.8)(323.5 — 324.2) 

(326 — 321)(326 — 322.8)(325 — 324.2) ^ ^ 


— — 0.07996 + 1.18794 + 1.83897 — 0.43708 

— 2.50987. 


This result is correct to the last figure. 

The above example has already been worked in Art. 26 by formula 
(26. 1). The student should compare the amount of work in the two cases. 


Example The following table gives the values of the probability 
integral {2/yJir)f“tr**dx corresponding to certain values of x. For what 
value of X is this integral equal to }? 


(2/Vv)/;r-’dx 

z 

0.4846556 

0.46 

0.4937452 

0.47 

0.5027498 

0.48 

0.5116683 

0.49 


Solution. Calling y the value of the probability integral, we have 
y — J — 0.6, X. — 0.46, x, — 0.47, x. — 0.48, x. — 0.49. 

Substituting these in (27.3), we get 

(0.5 — 0.4037462) (0J( — 0JM)27408) (0J( — 0.5116683) 

• “ (0.4846556 — 0.4037452) (0.4846656 — 0.5027408) (0.4846656 — 0.5116683) ^ 

(0.5 — 0.4846655) (0.6 — 0.6027408) (0.6 — 0J(116683) 

+ (0.4837462 — 0.4846555) (6.4837452 - 00(027408) (0.4037462 — 0J(11668S) 

(0.5 — 0.4846566) (0J( — 0.4037452) (OJf — 0.5116683) 

(0.5027488 — 0.4846555) (0.5027406 — 0.4037452) (0^027408 — 0.5)16683) X 0-M 
(0.6 — 0.4846655) (0.5 — 0.4037452) (0.6 — 0.5027408) 

+ (0;5~iT 6683 - 0.4646666) (6.61l66U-6:4ffm62) (62lliMii3 - 6.5027408) X ® " 
62548 X 27408 X 116683 
00887 X 180043 X 270128 X 


Abt. 27] 


EXERCISES 


77 


, 163446 x 27498x116683 
+ 90897 x 20046x179231 X 
. 163446 X 02648 x 116683 
+ 180943 X 20046 x 82186 X 

163446 X 02648 x 27498 
“■ 270128 X 179231 x 89186 X 0.49 

» — 0.0207787 + 0.167737 + 0.300028 r- 0.0209496 
o> 0.470937. 

The true value to six decimal places is 0.476936. 

Note. The computation in this problem should be performed by 
logarithms unless a calculating machine is available. 

Remark. The reader who has followed through the computation in 
the two preceding examples will have noticed that Lagrange’s formula 
is tedious to apply and involves a great deal of computation. It must 
also be used with care and caution, for if the values of the independent 
variable are not taken close together the resultr- ere liable to be very 
inaccurate. 


BZXSCI8B8 III 


1. Using formula (36.1), find from the following table the value of y 
for X — 6.60276. 


X 

6.600 

6.602 

6.606 

6.607 

6.608 

y 

0.77666688 

0.77682686 

0.77871260 

0.77996671 

0.78069114 


2. The following table gives some relations between steam pressure and 
temperature. Find the pressure at temperature 372**.!. 


T 

361* 

367* 

378* 

09 

00 

O 

1 399* 

P 

164.9 

167.0 

191.0 

212.6 

1 244.2 


8. Find from the following table the value of y for x — 0.632762. 


X 

0.6306 

0.6342 

0.6367 

0.6382 

y 

1 0.67i 

1112024 1 
1 0.6416 

0.677676761 

1 0.6462 

0.679389428 

0.682413940 


0.686412828 j 0.692121131 



78 


INTERPOLATION— UNEQUAL INTERVALS OF ARGUMENT [Chap. HI 


4 From the data in the following table find by Lagrange’s foimnlas the 
Talne of y when x — 102 and the value of x when y — 13.6. 


X 

V 

93.0 

11.88 

06.2 

* 12.80 


14.70 

104.2 

17.07 

108.7 

19.91 


CHAPTEB lY 


CENTRAL-DIFFERENCE INTERPOLATION FORMULAS 

88. I&trodttctioii. Newton’s formulas (I) and (II), derived and illus- 
trated in Chapter II, are best suited for interpolation near the beginning 
and end, respectively, of a table of differences. For interpolation near 
the middle of a difference table, central-difference formulas are preferable. 
These formulas employ differences lying as nearly as possible on a horizontal 
line through yo u & diagonal difference table. 

The most important central difference formulas are the two known as 
Stirling’s and Bessel’s. We shall derive them by first deriving three other 
central-difference formulas and then taking the means of the latter in pairs. 

88. Gauss’s Central-Difference Formulas. 

(a). Gauss's forward formula. In the general Newton formula (86. 1), 

put ®o — a:o, i, — Xo + A, x, — Xo — h, x, — Xo-j-ZA, x* — Xo — 2A, 
Xj =• Xo 3A, Xe — Xo — 3A. 

Then we have 

(29.1) y — yo4-(«— *o)«(*o,*b + A) 

+ (x—Xo)(x — Xo — h)t(Xo,Xo + h,Xo — h) 

+ (*— *«)(x— Xo— A)(*— x# + A)3(xb, Xb -I- A, X* — A, Xo -f- 2A) 
-f- (x— Xo)(x— Xb— A)(x— X* -I- A)(x— X,— 2A)«(xb,x*-|- A, 

, Xq — A,Xb-{-8A,Xo — SA) 

-|- (x — *o)(» — ®a — A)(x — Xo A)(x — Xo — 8A)(x — Xo 2A) 

X *(*o,*q + A,Xo — A,xb-f- 8A,Xb— 8A,Xo + 8A) 

Now put tt=» * — Xa-^hu. Then 

(29. 2 ) y — yo + Au«(x* Xo + A) Att(Att— A)a(xo— A, x®, x# -f A) 

+ Att(Att — A)(Au -f- A)6(Xo — A, Xb, Xo 4- A, Xb -f SA) 

4- Att(Att — A)(Att -|- A)(Au — 8A)8(xo — 8A, Xo — A,Xo, 

*0 4" A, Xo -4- SA) 

4- Att(Au — A)(Au -f- A)(Att — S A)(Att -f- 8A)8(xb — SA, xb — A, 

xb,Xo 4- A,xb -1- 8A,s^ 4- 8A) 


79 



80 


I»™OTUTION-^aNTlUUDimill»CBTOIUl<I^ (C»„. „ 

26. l)f 


But, by (26.1), 

sto + 24) » 

^ 8!A» 

<(*•— •2A, + 2A)— 

* 4!A‘ 

«(*• — 2A, *• — A, *., *, + A, a-^ ^ 2A, *. + 3A) - ^ 

’ 6lh» * 


««, into «« p.w» * i. 

(29.8) y-yo + uAyo -f tt(«— i) ^ ^ A»y.. 

* '81 

This is the Oauss forward formula. ** 

■nak. the foIlo.ii.* fo^M^ 

Then (26.1) becomes 

(*»■ ■*)»-,.+ ^ 

+ (*-..K.-.. + + *) 

X *(*«»*l.-4,». + A,aro~2A,s:. + 2A,;, ^ 

Now put tt — — A_ » , . 

A > *Q + Alt. Then 

• ,*, 2A,*,— A,!i!b,*^ + A,a^^2jj 



Art. 29] GAUSS’S FORMULAS 81 

But 

«(»0— + — 

S(xo“2fc, Xo — fc, Xo fc) ■“ 

2 

t(Xo — 2fc, Xo — hf Xof Xo 3fo “4“ 2fc) ^1^4 

«(X 9 — 3fc, Xo — 2fc, Xo — A, Xo, Xo + A, Xo + 2^) — 

Substituting these into (29. 5) and canceling the powers of A in the several 
terms, we get 

( 89 . 6 ) y — yo + «Ay-» + «(« + !) ^^ + u(u*—l) 

+ it(«* _ l)(u + 2)^ + tt(u* - l)(tt* - 2*) ^ , 
whidi is the Gauss backward formula. 

(e). A third Gauss formula. For the derivation of Bessel’s formula 
we need a third central-difference formula that starts with yi and runs 
parallel to the backward formula (29.6). To derive such a formula we 
advance the subscripts of x and y in (29.4) by one unit, remember that 
X| — A — Xo, and change the u’s by putting A — i 1 in the general formula 



and thus get the relation 

u — 1— — , or X — Xi — Att — A 

These changes amount to advancing all subscripts in (29. 6) by one unit 
and replacing u by u — 1. These changes then reduce (29.6) to 

(89. 7) y — + (v — l)Ayo + u(« — l)^ -f- u(tt — l)(a — 2) 

-l-u(tt*!-l)(u-2)^-|-«(tt»-.l)(u-2)(«-3)^, 
which is the desired formula. 

The following table shows the paths of the three Gauss formulas across 
a diagonal difference table. The designations Oi, Ot, Oa refer to formulas 
(29.3), (29.6), and (29.7), respectively. 



82 INTERPOLATION— OENTRAL-DIFFERENCE FORMULAS [Chap. IV 

m A*jf A*y A*y A*y 



SO. StiiUng’s InterpdAtion Fornmla. Taking the mean of fonnulaa 
(89.8) and (89.6), by adding them and diyiding the anma throughout 
by 8, we get 

tt«(tt«— 1) tt(tt»— l)(tt«— 8«) A*y-, -h A»y-, 

-r- ary-*-t g, 3 » 

which is Stirling’s formula. Note that it goes horiiontally through y*. In 

its more general form Stirling’s formula is 



abt. ao] 


STIRLING’S FORMULA 


(HI) + + + 


— l»)(tt« — 2*) A*y., + A»y., 


„*(«»_ 1*) 
41 


+ ' >11 '^*y-*+ 51 


6! 

■ tt(tt* — l«)(tt* — 2»)(tt*~3«)- • • [tt» — (n — 1)»] 

"*■ (2n — 1)1 

vx A*"-»y-, + A*»-*y.<,^t, 

^ 2 

+ (2^01 

where « — (x — x«)/&. 

In this formula there are 2n -|- 1 terms, and the polynomial coincides 
with the given function at the 2n + I points 

tt — — n, — (n — 1), — (n — 2), 2, — 1,0, 1,2,- • •» — 2,» — l,n; 

or 

x_So — hh,Xo — (» — l)h, • • • Xo — fc, Xo, Xo + h, • * • *0 + (« — 1)^> »o + nh 

The path of Stirling’s formula across a diagonal difference table is 
shown in the table below. The quantities that occur in the formula are 
printed in heavy ^rpe. 



A|f 

A”y 

A«y 

A*|f 

A*y 

y-4 

T 

<1 





if-t 

A|f-i 

A*y-« 

A*y.4 



y-i 

Ay-i 

A'y-. 

AV-. 

HV-. 

AV- 

y-i 

Ay-. 

AV. 

AV. 

AV- 

AV- 


Ayb 

AV. 

AV. 

AV- 

AV- 

ift 

A|f. 

AV. 

• AV* 

AV- 

AV- 

Sft 


AVt 


AV. 



^1 

• 

AV. 


AV. 

Ift 

A|ft 

A*y. 

AV. 

AV. 


ir« 

Ay* 

A*y. 




ift 











AV-4 

A*r-. 

AV-» 

AV. 


AV-. 

AV-. 

A»F.. 


AV-. 


Tabu 7. 


84 


INTERPOLATION-CENTRAL-DIFFERENCE FORMULAS [Chap. IV 


81. Btssel’s latwpolatioii Fomulu. On taking the mean of (89.8) 
and (29. 7), by adding them and dividing by 8, we obtain 

(M.1) + 

I t i(tt— i)(«— 1) . * ■ tt(tt«— l)(tt— 8) (AV,-fA«y-, ) 

■^8! ‘•'4! 8 

This is one form of Bessel’s formula. It follows a horizontal line mid- 
way between and yi-in the difference table. 

The above formula is frequently written in a slightly different form. 
Since — yo> the first two terms can be transformed to yo + ^Vo* 

Then (31.1) becomes 

(81.8) y — yo + iiAyo+ - ^ ^ AV, 

, tt(«*— l)(tt— 2) A*y.,-f AVi . 

+ ii i +■ ' ■ 

The general form of Bessel’s formula is 
<IV) ,_,.+u4,.+ “(5^ 4V. 

tt(tt«— l)(tt— 2) A*y.,-|-A«y.t u(u— i)(tt*— l)(tt— 2) 

■^4! .2 61 

tt(tt* — !)(«»— 4)(tt— 3) A*y-, 4- AV, 

6! 2 ■ 

u(tf«— l)(n»— 4)- • (tt— n)(tt-f-n— 1) A»*yH» + A»*yHK 
(2n)! . 8 

tt(tt—i)(tt»— !)(«*— 4)- ■ (tt— n)(tt^n— 1) 

(85+1)! ^ 


A*y-a 


If we put It — i in (IV), we get the simple formula 



Art. 31] 


BESSEL’S FORMULAS 


85 


,v\ u ^ y« + 1 ^*y-x + ^ av, + a Vt s A«y-,+A«y-, 

' ^ 8 * 2 ■^ 128 2 1024 2 

, . . . . [l-3 6- -(2n-l)l« A««yH> + A«-y^» . 

■>■ -1-^ 2i.(2„)j 2 


This important special case of Bessel’s formula is called the formula for 
interpolating to halves. It is used for computing values of the function 
midway between any two given values. 

A more symmetrical and convenient form of Bessel’s formula is obtained 
by putting u — \ — v, or u — v -}* Making this substitution in (IV), 
we get 


(VI) y 


yo+yi 


+ vAyo + 




A Vi + A*yo 
2 




AVi 


I) AV, + AV. 

4! 2 5! 

4. (t;» -i)(r^-?)(r»-V) AV, + AV. . . 

' 6! 2 

I («• — i) — I) • • • [»• — (2n — 1) V4] A*V, -f A**y.,.| 

(2«) I 2 

. •[t>*-(2n-l)V4l 

(2nH-l)! 


In formulas (IV) and (VI) there are %n-\-2 terms, and the polynomials 
represented by them'coincide with the given function at the 2n -|- 2 points 

It — — ft, — n -}- 1, — n -|- 2, • • • — 1, 0, 1, 2, • • • fi, ft -j- 1 1 

2ft+l 2ft— 1 3 113 2ft— 1 2ft + l 

2 ’" 2 ’ ' 2 ’ 2 ’ 2 ’ 2 ’ ’ 2 ’ 2 ’ 


X — Xo — a:, — (ft — 1)A, • • *0 — k, *b, *« + k, • • • Xo + ftk, *0 +(* + l)k. 

The zero point $or the v’s is Xo + k/2, whereas for the u’s it is 


The following table shows the path of Bessel’s formula across a diagonal 
difference table. The quantities that occur in the formula are printed in 
heavy type. 



86 


INTERPOLATION— CENTRAL-DIFFERENCE FORMULAS [Chap. IV 


Ay A*y A*y A*y A*y A*y A*y A*y 


Table 8. 

We shall now apply Stirling’s and Bessel’s formulas to some numerical 
examples. 

Example 1. The following table gives the values of the probability 
integral o /»* 

for certain equidistant values of x. Find the value of this integral when 
x — 0.6437. 



Ay(x) Ay(*) 


0 6378987 


0.5464641 


0.5649392 


0.6633233 


0.5716157 



0.57 


0.5798158 


82001 











Aut. 31] 


ILLUSTRATIVE EXAMPLES 


Solution. Here we take Xo — 0.54 and x — 0.5437. Since h « 0.01, we 
have 

x — xo 0.5437 — 0.54 0.0037 „ „„ 

“ — — r— ^ ■“ ^ — 0.37. 

h 0.01 0.01 

(a) Using Stirling’s formula, (III), we have 
/(0.6437) — 0.5549392 + 0.37^ - ^ ^^'^^^^'*-^ 


I JIO) I 0-3n0-37*-l) (-7-7) 

2 ' ^ 2 6 

— 0.5549392 + 0.00311895 — 0.00000623 + 0.00000004. 

0.5580520. 


(b) To find /(0.5437) by Bessel’s formula it is more convenient to use 
(VI). Here 

i> — u — i — 0.37 — 0.50 — — 0.13. 

Substituting in (VI), we have 

/(0.5437) ^ 0.5549392 + 0.5633233 ^ ^ggg^jj 


0.0169 — 0.25/— 910 


917^ I —0.13(0.0169 — 0.25) (—7) 


— 0.55913125 — 0.00108993 + 0.00001065 

— 0.5580520. 

Example 2. The values of e-* for certain equidistant values of x are 
given in the following table. Find the value of e~' when x— 1.7489. 



e-» 

• 

A 

A* 

1 

A* 

A* 

. 1790661479 

-17817379 




.1772844100 

-17640094 

177285 

-1762 


1.1765204006 


175523 


+13 

• 

-17464571 


-1749 


. 1737739435 


173774 


+22 

• 

-17290797 


-1727 


.1720448638 

-17118760 

172047 

-1712 

+16 

1.1703329888 

-16948415 

170335 



.1686381473 











INTERPOLATION--GENTItAL-DIFFERENCE FORMULAS [Chat. IV 


Sciulion, 

(a) By Stirling’s formula. 

Here we take 1.7489, Xo 1*75, h » 0.01. 


Hence 


tt — 


1.7489 — 1.75 


0.01 

Substituting in (III), we have 


0.0011 

0.01 


- — 0 . 11 . 


/(1.7489) -0.1737739436 — 0.11 

0.0121 ( 173774 ) _ 0 . 11 ^ 


(—17464571 — 17290797) 
2 


+ 


2 


0.0121 — 1 ^ 1749 — 1727 ^ 


6 


+ 0.0121 


1 ^ 0.0121 — 


24 


i) ( 82 ) 


— 0.1737739435 + 0.00019116452 
+ 0.00000010513 — 0.00000000315 ; 
or /( 1.7489) — — 0.1739652000. 

This value is correct to ten decimal places. 


(b) By Bessel’s formula. 

Since the value 1.7489 is nearer to the middle of the interval 1.74 — 1.76 
than it is to the middle of the interval 1.75 — 1.76, we take Xo — 1.74 so 
as to make v as small as possible. Hence we have 


.*. /(1.7489) 


a — 


1.7489 — 1.74 


— 0.89, 


0.01 

, ti — i — 0.89 — 0.50 


0.39. 


M^55.294006 4-0.1T37739435 ^ ^ ^ 

^ ^ 0.39» — 0.25 ^ ^ 175523 + 173774 ^ 

+ 0.30 1,49) • 

(0.39* — 0.25) (0.39* — 2.25)/ 13 + 22\ 

”r OA I o I 



Art. 31] 


BESSEL’S FORMULAS 


SB 


— 0.17464717205 — 0.00068111827 
— 0.00000085490 -f 0.00000000111 
+ 0.00000000001 ; 

or /( 1.7489) — 0.1739652000, as before. 

We could also take ar® — 1.75, in which case we should have w — — 0.61. 
This would give 

/(1.7489) =- 0.17290940365 + 0.00105473862 

+ 0.00000105562 + 0.00000000214 
— 0.00000000002 — 0.1739652000. 

This value is also correct to ten decimal places, but the series converges 
slightly less rapidly than in the preceding case; and both of these series 
given by Bessel’s formula converge a little less rapidly than the one given 
by Stirling’s formula. 

Remark. The question naturally arises at this point as to which is the 
more accurate, Stirling’s formula or Bessel’s. The answer is that one is 
about as accurate as the other. For a given table of differences the rapidity 
of convergence depends upon the magnitude of u in the case of formula 
(III) and upon the magnitude of v in the case of formula (VI). The 
smaller the values of u and v the more rapidly the series converge. We 
should therefore always choose the starting point Xo so as to make u and v 
as small as possible. In most cases it is possible to choose the starting point 
so as to make — 0.5 ^ ^ 0.5 and — 0.5 ^ v ^ 0.5. Thus, in Example 1 

the starting point was so chosen that u — 0.37, u — — 0.13; and in 
Example 2 we had u — — 0.11, v — 0.39. It is to be noted that Bessel’s 
formula converged the more rapidly in the first example and Stirling’s 
the more rapidly in the second, th2 reason being that v was smaller than 
tt in the first case and u smaller than v in the second. 

As a gemral rvie it mag be stated that BesseVs formula will give a more 
accuraie result when interpolating near the middle of an interval, sag 
from u — 0.25 to 0.75 (v — — 0.25 to 0.25); whereas Stirling's formula 
will give the better result when interpolating near the beginning or end 
of an interval, from u — — 0.25 to 0.25, sag. 


For another phase of this question see Chapter VI. 



90 INTERPOLATION— CENTRAL-DIFFERENCE FORMULAS [Chap. IV 

Exantj^e S. The following table gives the values of the elliptic integral 


Fif^) - f* 

for certain equidistant values of 


VI — 1 sin* ^ 

Find the value of F(23*’.6). 


0 

F(0) 

AF 

A»F 

A»F 

A*F 

21* 

0.370634373 

18070778 




22 

0.388705151 

18129780 

60002 

2707 


23 

0.406834031 


61709 


4 


. 

18191489 


2711 


24 

0.426026420 

18255900 

64420 

2704 

-7 

26 

0.443282329 

18323033 

67124 



26 

0.461605362 






Solution. Since we are to find the value of the function halfway between 
two given tabular values, we use formula (V) for interpolating to halves. 
Hence we have 


F(23".6) 


0.406834931 + 0.425026420 1 61709 -f 64420 

2 8 2 

3 4 — 7 

128 2 

0.4159306755 — 0.0000078831 — 0.415922792. 


This result is probably correct to its last figure, since the differences in the 
table are perfectly regular and decrease rapidly. 


BXBKCISM IV 

1. Find logio tan 66' 43".5 by Bessel’s formula (IV) or (VI), given 

log tan 62' — 8.1797626 — 10 
" 63 i- 8.1880364 — 10 
" 64 —8.1961666 — 10 
" 66 — 8.2041269 — 10 
" 66 —8.2119526 — 10 


H 













Art. 31] 


EXERCISES 


91 


« « 67 —8.2196408 — 10 
« « 58 —8.2271953-10 
« « 69 -8.2346208 — 10. 


8. Find cos 0.806595 by Stirling’s formula, given 


cos 0.8050 
“ 0.8055 
" 0.8060 
" 0.8065 
“ 0.8070 
“ 0.8075 
" 0.8080 


0.693111235 

0.692750733 

0.692390058 

0.692029210 

0.691668188 

0.691306994 

0.690945627. 


8. Compute the value of (2/ V«-) when 2 — 0.6538, given the 
following table : 




0.62 

0.6194114 

0.63 

0.6270463 

0.64 

0.6345857 

0.65 

0.6420292 

0.66 

0.6493765 

0.67 

0.6566275 

0.68 

0.6637820 


4 . The mean atmospheric refraction, R, for a star at various altitudes 
h** above the horizon is given in the table below. Using Bessel’s formula 
for interpolation to halves, find the refraction for a star at an altitude 
of 27^ above the horizon. 


h 

• 

R 

22* 

r 23' .3 

* 24 

2 10 .2 

26 

1 58 .0 

28 

1 49 .2 

30 

1 40 .6 

32 

1 33 .0 


7 






92 


INTERPOLATION— CENTRAL-DIFFERENCE FORMULAS IChap. IV 


5. The declination of the moon at the beginning (noon) of certain days 
in August, 1918, was as given below. Compute the declination for 9:36 
p. M., August 25. 


Aug. 

20, 

— 16“ 

O' 

61" 

.0 

i€ 

21 

— 11 

24 

51 

.8 

(i 

22 

— 6 

3 

29 

.4 

€€ 

23 

— 0 

17 

25 

.8 

ti 

24 

+ 6 

30 

21 

.5 

l€ 

25 

10 

66 

40 

.3 

U 

26 

15 

39 

57 

.8 

%i 

27 

19 

22 

3 

.7 

U 

28 

21 

49 

48 

.3 


29 

22 

56 

22 

.8 

u 

30 

22 

41 

54 

.1 


6. The values of an elliptic integral for certain values of the amplitude 
^ are given in the table below. Compute the value of the integral when 
— 24® 36' 42". 


* ! 


21 * 

0 370634373 

22 

0 388705151 

23 

0 406834931 

24 

0 425026420 

25 

0 443282329 

26 

0 461605362 

27 

0 479998225 







CHAPTER V 


INVERSE INfERPOLATION 


32. Definition. Inverse interpolation is the process of finding the value 
of the argument corresponding to a given value of the function when the 
latter is intermediate between two tabulated values. The problem of 
inverse interpolation can be solved by several methods, but in this book 
we shall explain only three. 

33. By Lagrange’s Formula. One method of dealing with the problem 
is to use Lagrange’s interpolation formula in the form (27.3), in which 
X is expressed as a function of y. Example 2 of Article 27 was really a 
problem in inverse interpolation. We shall therefore not explain this 
method further. 


34. By Successive Approximations. A second method is that of succes- 
sive approximations or iteration. To see how this method is applied let 
us consider Newton’s formula (I), namely, 


, . , tt(ii — 1) , tt(tt — 1)(« — 2) 

y -yo + ttAyo + A»yo + 


3! 


A*yo 


+ 


u(u — l)(tt — 2)(tt — 3) 
4! 


A*yo + 


Transposing and dividing through, by Ayo, we have 

y — y o u{u—l)&,^yo u(u — l)(u — 2)A»yo 

' ' Ayo 2Ayo 3! Ayo 

_ tt(tt — l)(tt — 2 )(m — 3) A*yo 
4! Ayo ’ 

To get a first approximation for u, we neglect all differences higher 
than the first and therefore have 


Ayo 

The second approximation is obtained by substituting in the right- 
hand side of (1). We then have 


03 



04 


INVERSE INTERPOLATION 


[Chap. V 


/o\ M y— yo <*>—!) A»y, _ !)(«<*>— a) ^ 

(») “ “ Ayo 2 Ayo ” 31 Ayo 

^(x) (^(1) — i)(tt<i) _ 2)(tt(i) _ 3) A*y, 

4 ! Ayo * 

The third approximation is 

/o\ (.) y-yo u(«>(uW-l) A»yo A»yo 

“ “ Ay, “ 2 Ay, 3! Ay, 

tt(«)(u(t) _i)(u(«) _2)(tt(») _3) A«y, 

41 Ay, ’ 

And 80 on for higher approximations. 

We shall now illustrate the method by working an example. 

Example 1. Oiven a table of values of the probability integral 
(S/Vv) for what value of x is this integral equal to 


1 

X 

y 

Ay 

A*y 

A*y 

A^y 

0 45 

0 4754818 

91737 




0.46 

0.4846555 

90897 

-840 

-11 


0.47 

0.4937452 

90046 

-851 

-10 

1 

0.48 

0 5027498 

89185 

—861 

- 8 

2 

0 49 

0.5116683 

1 

88316 

-869 




0 5204999 






Solution. Here it is better to use a central-difference formula. 
Inspection shows that the desired value of x lies between 0.47 and 0.48, 
and a rough linear interpolation shows that it is about 0.47}. Hence 
we take x, — 0.47 and use Bessel’s formula. We theisfore have 

X, — 0.47, h — 0.01, y — }.0.6. 

Substituting in Bessel’s formula (VI) this value of y and the appropriate 
quantities from the table, we have 













Aw. 34] 


BY SUCCESSIVE APPROXIMATIONS 


96 


0.6 — 0.4982476 + 0.0090046i; + (_ 0.0000866) 

+ ^ (— 0 . 0000010 ). 

6 

Transposing and dividing through by 0.0090046, we get 

(4) V — 0.194623 — («*— 0.26)(— 0.004763)— — 0.26)(— 0.0000186). 

A first approximation for v is obtained by neglecting all terms bqrond 
the first in the right-hand member of (4). Hence 

i;<i) —0.194623. 

Substituting this for v in the right-hand member of (4), we find tne 
second approximation to be 

—0.194623— [(0.194623)* — 0.26](— 0.004763) 

— 0.194623[(0.194623)* — 0.26] (— 0.0000186) 

— 0.194623 — 0.001008 — 0.000001 — 0.193614. 

Now substituting this value for v in the right-hand member of (4), we 
find 

»(•) — 0.194623 — 0.0010101 — 0.000001 — 0.193612. 

This value differs Only slightly from the preceding, and we therefore 
make no further approximations. 

Since « — v -f- ^ and ® — Xo + hu, we have 
u — 0.693612, 

X — 0.47 -f- 0.01(0.693612) — 0.47693612. 

This value is correct to six decimal places. 

Note. In this example it is not possible to obtain more than five 
trustworthy figures in the value of v, because the right-hand member of 
(4) is the result ^of a division by the approximate number 0.0090046, 
the fifth significant figure of which is uncertain. As a matter of fact, 
only the first four figures in v are correct. 

If all differences higher than the second are negligible, the problem of 
inverse interpolation amounts only to the solution of a quadratic equation. 
The following example illustrates this. 



96 


INVERSE INTERPOLATION 


[Chap. V 


Example S. Given sinh a; — 62, to find x. 

Solution. Forming a difference table as shown below, we find that all 
differences above the second are zero. We also notice that the required 
value of X is slightly greater than 4.82. Hence we take a;o--’4.82 and 
use Stirling's formula. 


X 

y^sinh x 

Ay 


A*y 

4.80 

60.7511 






6106 



4.81 

61.3617 


62 




6168 


0 

4.82 

61 9785 


62 




6230 


0 

4.83 

62 6015 


62 




6292 



4.84 

63 2307 





Substituting y — 62 in Stirling's formula, (III), we have 
62 — 61.9785 + 0.6199U + O.OOSlu*, 
or 31ii* 4- 6199U — 215. 

— 6199 4- V(6199)* + 4 X 31 X 215 _ — 6199 4- 6201.15 
62 *62 

2.15 

— — 0.0347. 

Since h — 0.01 and x — Xg 4- get 

X — 4.82 4- 0.01 (0.0347) — 4.8203. 

35. By Reversion of Series. The most obvious method of solving the 
problem of inverse interpolation is by reversion of series; for all the inter- 
polation formulas thus far developed are in the form of a power series, 
and any convergent power series can be reverted. Thus, the power series 
(1) y — Og OiX -t- Oix* 4- OgX* 4- * * ' o*x* 4- • • • 

when reverted becomes 




Art. 35] 


BY REVERSION OF SERIES 


97 



When reverting a series with numerical coefficients, it is better to com- 
pute the c’s from equations (3) and then substitute their values in (2). 

We shall now write Newton’s, Stirling’s, and Bessel’s formulas in the 
form of power series and then write down the values of Oo, Ox, * * *04 
in each case. We stop with fourth differences, but the reader will have 
no difiSculty in extending them to higher differences if necessary. 

0 ) Newton's Formula (I). 


, . . u(u — 1 ) , u(u — 1 )(« — 2 ) 

y — yo + T4Ayo+ -■ — gj 


^ u(u — l)(u — 2)(u — 3) 
4 1 




( 


A*yo 


4 / ^ 24 


Here 


00 

01 

I 

92 

a$ 

a* 


yo, 

Ay,- -^+-3 

A*y» A*yo . llA*yn 

2 2 24 ’ 

A*yo A*yo 

G 4 ’ 

A*yo 

~u‘ 



98 


INVERSE INTERPOLATION 


[Crap. V 


b) Stirling's Formula. 


+ 


24 ' 


Here 

<*0 — yo 

«! — i ( Ay_i + Ayo) — ^ (A*y-, + A*y_i) 

Oj — iA*y_i — ^ A*y_t 

08 — ^ (A*y-2 + AV») 

A*y_*. 


f) Bessel's Formula (VI). 

,_S!i±li +.4,.+ 

. t>(t>*-i) („8_^)(t;«_|) A«y-, + AV. 

- 1 - 6 24 2 

— i(yo + yi) — TV(^*y-i + A»y,) + ^(AVa + A*y_i) 

+ (Ayo — ^ A»y.i)» + [i(A*y-i + A*y ) — ^ (AVa + A*y_i)]v* 

+ iA*y_iV* + ^ (A*y-, 4- A«y_t)v*. 



Abt. 36] 


BY REVERSION OF SERIES 


W 


Here we have 

Oo — i(yo + Vt) — + A»Fo) + ^ (A‘y-t 4- A'y-O 

«i — Ayo — ^ AVi 

o* — i(A*y_i + A*yo) — ^ (A*y., + A^-i) 

«s — jAVi 

«*— ^ (AV» + A*F-i). 

We shall now work Examples 1 and 2 of the preceding article by 
reverting the series. For Example 1 we nse Bessel’s formula as before. 
From the table on page 94 we get 


i (Fo + Fi) — 0.4982475, i (A*y_i + A*yo) 0.0000856, 

i(AV* + A"y-i) — 0.00000015. 


Hence 


flo — 0.4982475 + — — 0.4982582, 

O 

— 0.0090046 + — 9.00900464, 


a. 


0.0000856 

2 


0.0000428, 


as 


0.0000010 

6 


Us — O, practically. 
Since y — i — 0.5, we have 


0.00000017, 


Also 


y— a. _ 0.5 — 0.4982582 
Oi 0.00900464 



0.0000428 
0.00900464 "" 

(_ 0.004753)* — 


— 0.0000001074, 
0.00000017 


01 


-0.1934336. 

— 0.004753. 
0.0000225910, 


— 0.00001888. 


0.00900464 



100 INVERSE INTERPOLATION [Chap. V 

Hence 

c, — — 0.004753, 

Cx — 0.00001888 + 2(0.000022591) — 0.00006406, 

c, — 0 + 5(— 0.004753) (—0.00001888) — 5(— 0.0000001074) 

— 0.000000986. 

Substituting these quantities in (2), we get 

V — 0.1934336 + 0.004753(0.1934336)* + 0.00006406(0.1934336)* 

— 0.1934336 + 0.0001778 + 0.00000046 

— 0.193612. 

Hence 

« — > t> -}- i •“ 0.693612 

and 

x — Xo + Att — 0.47 + 0.01(0.693612) — 0.47693612, 
which is the same value as found by the method of successive approximations. 


To solve Example 2 we use Stirling’s formula, as before. Here 

<*0 “Vo =• 61.9785, 

Ui — 0.6199, 

0.0062 


Ox ■ 

Os — o« — 0. 


0.0031, 


Since y — 62, we have 

y-o, — 62 — 61.9785 


0.0215. 


Hence 


. y — Op 0.0215 

' * ai “ 0.6199 ' 
Ox _ 0.0031 
Oi “ 0.6199 


0.034683, 

- 0.005001. 


Cl 0.005001, Cx — 2(0.005001 )* — 0.00005002, 

Cs — 0, practically. 


Substituting these values in (2), we have 

u — 0.034683 — 0.005001(0.034683)* ' 
— 0.0347. 

.'. X — 4.82 + 0.01 (0.0347) — 4.8203, 


as previously found by the method of iteration. 



Abt. 35] 


EXERCISES 


101 


Remark. The problem of inverse interpolation should be dealt with in 
practice by the iteration process when only a few digits are to be substituted 
in the right-hand member, and by reversion of series when the number of 
digits involved is large. 


EXERCISES y 

If cosh a; — 1.285, find x by inverse interpolation, using the data in the 
following table : 


X 

cosh X 

0.735 

1.2824937 

0.736 

1.2832974 

0.737 

1.2841023 

0.738 

1.2849085 

0.739 

1.2857159 

0.740 

1.2865247 

0.741 

1.2873348 

0.742 

1.2881461 



CHAPTBB VI 


THE ACCURACY OF INTERPOLATION FORMULAS 

86. Introduction. In the preceding articles we have dealt with poly* 
nomial formulas for representing a given function over an interval, ^ese 
polynomials coincide with the given function at the points (iCo> t/o), 

(^ 2 >y 2 )) etc. Hence it is reasonable to suppose that we can make these 
polynomials approximate the given function as closely as desired by merely 
increasing the number of coinciding points. Such indeed is the case if we 
don’t attempt to spread over too wide an interval, but the necessity for 
caution in this matter will appear from the following considerations. 

When the number of points Xo, i,, • Xn increases indefinitely, the 

polynomial interpolation formulas become infinite series, called interpola- 
tion senes I and just as a power series converges in a certain interval and 
diverges outside the interval, so likewise an interpolation series converges 
and represents the given function over a certain interval but fails to reprc 
sent it outside of that interval. For example, if we should attempt to 
represent the function 1/(1 -j- x*) over the interval —5^x^5 by an 
interpolation series, we should find that the series would not represent the 
function at all when x ~ 4. As a matter of fact, the series would con- 
verge and represent the function to any desired degree of accuracy between 
® “ — 3.63 and x “ -f- 3.63, but would diverge and fail to represent it 
outside of this interval.* The investigation of the convergence of inter- 
polation series is a somewhat lengthy matter and requires the use of func- 
tions of a complex variable, f We shall therefore not enter into it, but 
merely derive expressions for the remainder terms in the polynomial formu- 
las previously considered. 

37. Remainder Term in Newton's Formula (I) and in Lagrange's 
Formula. The derivation of the remainder term in a polynomial inter- 
polation formula is very similar to that of finding the remainder in Taylor's 

*Runge, "fiber empiriache Funktionen und die Interpolation swischen iqul- 
dUUnten Ordinaten." ZeiUehrift far Math, and Phytik, vcl. XLVI , p. 228. 

See also Steffenaen’s Interpolation, pp. 35-38. 

t The intereated reader ahould conault the paper by Runge, cited'above, and alao 
the following Borel Monographa: NOrlund, Leeont tur lea Seriea d' Interpolation, 
Paris, Borel, Legona aur lea Fonctiona de Variablea BMlea et lea DevMoppe- 
menu en 8Mea de Polynomea, Paris, Montel, Legona aur lea SSries de Poly- 
nomea d une VariabU Compleme, Paris, Also Bunge’s Theorie und Pramia dor 
Moihe, Leipsig, 


102 



Abt. 37] REMAINDER IN NEWTON'S FORMULA (I) 103 

expansion. Thus, to find the remainder term in Newton’s formula (1) 
and in Lagrange’s formula, we write down the arbitrary function. 


(1) F(x)-/(s)-^(z)-[/(x)-^(»)] 


{z — Xo){z — Xi) -- {z — Xn) 
lx — Xo) (x — Xi) -'{x — Xh) ’ 


where f{x) denotes the given function, ^(x) a polynomial interpolation 
formula, and z a real variable. We shall assume that f{x) is continuous 
and possesses continuous derivatives of all orders within the interval from 
Xq to Xn. 

Now F{z) vanishes for the n + 2 values * — x, x©, Xi, ■ • • x#; and since 
f{x) is continuous and has continuous derivatives of all orders, the same 
is true of f{z) and hence of F{z). F{z) therefore satisfies the conditions 
of Rolle’s theorem. Hence the first derivative of F{z) vanishes at least 
once between every two consecutive zero values of F{z). Therefore in 
the interval from Xo to Xn, F’(z) must vanish n 1 times; F^'^z), n times; 
F'"{z), n — 1 times; etc. Hence the (n + l)th derivative of F(z) will 
vanish at least once at some point whose abscissa is (. 

Since ^(z) is a polynomial of the nth degree, its (n-)- l)th derivative 
is zero. Furthermore, since the expression {z — Xo)(* — — * 2 ) • * * 

(z — Xn) is a polynomial of degree n-|- 1, it follows that its (n-f* l)th 
derivative is the same as the (n l)th derivative of which is (n 4* 1) I 
On differentiating ( 1 ) n -f- 1 times with respect to z we therefore have 


But since (z) — 0 at some point z — we have 

(n+l)» 


0 -/(-«) (^)_[/(x)_^(x)] 


Hence 


lx—Xo){x — Xi) • • • (x — X,) 


/(x)— ^(x) — Xo)(x — X|)- • (x — X,). 


Now since /(x) — ^(x) is the difference between the given function and 
the polynomial at any point whose abscissa is x, it represents the error com* 
mitted by replacing the given function by the polynomial. Hence we have 

(2) Error — R» — ^ /j - (» — »o) (* — * 1 ) • • • (« — x«), 

where $ is some value of x between Xo and Xn. This is the remainder term 
in formula (2) of Art. 20 and in Lagrange’s formula (27. 1). 

To get the remainder term in formula (1) of Art. 20 we recall that 



ACCURACY OF INTERPOLATION FORMULAS 


[Chap. VI 


IM 


X — Xo—hu,x — *1— A(tt — 1),® — X2^h{u — — ®i»— A(tt — »). 

SttbBtituting these values of x — Xo,x — Xi, etc. in (2) above, we have 

(3) Bn — 1) (tt — 2) • - (tt— n). 

If the analytical form of the given function f{x) is unknown, then the 
best we can do is to replace (£) by its value in terms of differences. 
From Article 18 we have 


(o) A«/(®) — (a? + ^nA®), 0 < < 1. 


Putting ® — ®o and A® — A, we have from (a) 


(») fM(x, + «nk)-^^. 

Now since ®o + i are valuo.s of .r at points within the interval 

of interpolation (that is, between ®o and ®») we may, for practical purposes, 
put (^Xo -j-ffnh. Making this substitution in (b), we get 


(c) 

Hence we have 

(d) 


/<-> ii) 


A- 




A***/(®o) 

J^n*l » 


practically. Substituting this value of /("**> (^) in (3), we get 

(4) i?, — — l)(u — 2) • • • (tt— n). 

The smaller the interval A is taken the more nearly does (4) give the 
actual error. 


88. Remainder Term in Newton’s Formula (II). To find a formula 
for the remainder in Newton’s formula for backward interpolation we write 
down the function 




(g — gp) 
•(® ®o) 


differentiate it n + 1 times with respect to z, and put (*) — 0 for 
* — We thus find * 

"** I — Xn){x — ®a.i) ‘ * (® ®o), 

or 

(1) Error — ~ ~ *«’-*) ' ’ * (» “ »o). 



105 


Art. 38] REMAINDER IN NEWTON’S FORMULA (II) 

This is the remainder term for formula (2) of Art 21. 

To find the corresponding formula in terms of u we recall that 


■»» 


— tt, 


iC — Xn. 






-u + l 



— a-4-»- 


Substituting these values for x — Xn etc. in (1) above, we get 

(2) — «(« + !)(« -f 2) (tf + n). 

To find a formula for when the analytical form of the given function 
is unknown, we replace /***‘*(4) by in (2). The result is 


(3) Rn — + 1) (“ + 2) • * • (tt + »)• 

39. Remainder Term in Stirling's Formula. We next turn our atten- 
tion to the central-difference formulas of Stirling and Bessel. To find 
the remainder term in Stirling’s formula we write down the arbitrary 
function 


(1) F(z)~f{z)-<l.(z) 

(z — Xo)(z — Xi){z — X.i) 


[f(x)-<l>(x)] 


(x—Xo)ix—Xi)lx—x.i) 


• {z — Xn)(z — X.n) 

• {X — Xn)ix — X.n) 


This function vanishes for the 2n -|- 2 values z^x, x^, Xi, ' * ' z*, x.i, 
Z. 2 , ‘ ‘ 'Z.». We assume that f(x) is continuous and has continuous 
derivatives of all orders up to 2n -f- 1. Hence F(z) satisfies the conditions 
of Rolle's theorem. Also, since ^(z) is a polynomial of degree 2n, its 
(2n -f- l)th derivative is zero. Hence on differentiating (1) 2n 1 times 
and putting (z) — 0 for some value z — f, we get 


0_/(*-U(^)_o_ [f(x)-4>(x)T^ 
from which 


(2n-H)t 

—Xo)(x—Xi)(x—x.i)- • ■ (x—x,)(x—x.,) 


f(x)—<l>ix) — ='<•)(* — • • • (x—x^){x-x.,), 

or 


( 2 ) 


F-rror — 7?, 


(2n- -1-1)1 


(x—Xo)ix—Xi){x—x.i)- ■ (z— 


We write this formula in terms of u as follows: Since 
X — Zo — hu, z — Zi — h(u — 1),- • 'Z — z,— =A(u — n), and 
z — z.i — z — (zo — h) — z — Zo-l-fc — Att-t-h — h(tt-|-l), 
z — z., — A(tt-f 2),' • z — Zh, — A(tt + »)> 



106 ACCURACY OF INTERPOLATION FORMULAS [Chap. VI 

we have 

(3) R. - * «(»■ - IKt.' - 8‘)(»‘ - 8-) • • (u*-**), 

where ( is some value of x between x^ and 

If the analytical form of f(x) is unknown, we replace by 

rnnHu where 

^2n*i " 2 • 

Hence we get from (3) 

w *•- ■ ■(«»-■■>)■ 

In formulas (3) and (4) n is the number of intervals on each aide of ««. 

40. Remainder Terms in Bessel’s Formulas. The remainder term in 
Bessel’s formulas is derived by first writing down the arbitrary function 

(1) F(s)-/(s)-^(s) 

r- V .,.. Ua—Xo)(z—Xi)(z—x.i)‘ • • (»— g.)(s— g-Ks— x,,,) 

UK ) ♦WJ jjp _ j-jj _ _ j, j ) . . . (jp _ j.^)( j. _ a, • 

This function vanishes at the 2n-)-3 points z^x,Xo,Xi,x.i,- • 'Xn, 

Shu Smi. Since ^(s) is a polynomial of degree 2n -(- 1> its (2n + 2)th 
derivative is zero. Hence on differentiating (1) 2n4*2 times with 
respect to z and putting — 0 for some value s — we get 

rtf \ / M “t” ^ 

— _ j.^)(x — s-i ) • (i — x,)(x — x^){x — *,*,) ’ 
from which 

fix)—^{x)^-j^^^ix — Xo){x — Xt)(x — X.x)- • {x — xn)ix — x.„){x — x,ti), 
or 

(2) Error — R» 

" ^ s«)(s— s.)(s— a;.x)> * • (s-- 

Putting x — xn^hu, * — *1 — h(tt— 1), « — etc., 
as in the case of Stirling’s formula, we get 

^(2n + «(“— 1)(« + l)(tt — 2) • • (tt— n)(tt-}.n)(tt— n—l). 



Art. 40 ] 


REMAINDER IN BESSEL'S FORMULAS 


107 


This is the remainder term in formula (IV) of Art. 31. In terms of 
differences it becomes 


(4) Rn 
where 


— *)(“ + 1)(« — 2)(« + 2) 

• • -(u — n)(tt4-«)(« — » — 1)» 


^2nt2 *=• ~ 


On putting « = v + ^ in (3) and (4), we get 


( 6 ) Rn 

(6) Rn 


^2n42 


(2n + 2) 




These are the remainder terms in formula (VI) of Art. 31. 

Putting i; = 0 in (5) and (6), we get the remainder terms in the 
formula for interpolating to halves, namely 


(7) 

( 8 ) 


(2n + 2)! ' ' 

n ^ 2n,2 / [1 • 3 • ■*> • • (2w + 


41. Recapitulation of Formulas for the Remainder. We now colloct 
for easy reference the most important of the formulas derived in this 
chapter. 

1. Newton’s Formula (I) 

(a) f?. = -^j^«(«-l)(«-2) • (u-n). 

1)(« — 2) • • • (u — n). 

e. Newton’s Formula (ll) 

(a) !)(«+ 2) • • • (tt4 - m). 

(b) R" ° + 1 ) (« H- 2) • (H -l-n). 


8 



108 


ACCURACY OF INTERPOLATION FORMULAS 


[Chap. VI 


S. Stirling’s Formula, (III) 

(*) B. - «(«■- ■ ■ (u’-n’). 

(b) 

Bessel’s Formula in terms of «, (TV) 

L2nt2/(3ii«2)m 

+ ■ •(«—«)(“ + «)(“—»— 


(b) Rn — I “(«- )(u4-l)(« — 2) - • (u — /0(« + n)(« — n— !)• 


5. Bessel’s Formula in terms of v, (VI) 

G. Formula for Interpolating to Halves, (V) 


(a) 

(b) 


It. = - 


A2"*2/(-»**'(^) 


(2n + 2) ! 


— (— 1 ) 
• 21 ! ^ ' 


n*i 


If nt j„43 - j - . 

(<2„4.2)! ' ^ 


[13 5- • • (2n+ 1)]^ 

2Bn*2 

[1-3 5- • • (2n+l)]* 
22)1+2 


7. Lagrange’s Formula, (27.1) 


««= ■ C®— »a)- 

Where the formulas are given in pairs, the second form (b) should be 
used when the analytic form of the function is not known. 

To lessen the labor of computing R. from these formulas the student 
should, when possible, use the expressions for the nth derivatives given 
on page 38. 

It is not worth while to compute the remainder term in many appli- 
cations of Newton’s, Stirling’s, and Bessel’s formulas, because if the 
starting point is so chosen that u and t* are numerically less than 1 and 
if the differences of some order are practically constant, the interpolated 
result will usually be correct to as many figures os' are given in the 
tabular values of the function. This statement is baaed on the aasumptiftTi 
that all available differences are used in the interpolation formal^ or at 
least all differences which will contribute anything to the last figure 
retained. It is in those cases where the differences do not become eoMt an t 



Abt. 41] 


RECAPITULATION OF REMAINDER FORMULAS 


109 


or where it is impracticable to make use of differences above a certain 
order that we should compute the remainder term. 

When using Lagrange’s formula, however, the case is very different. 
Here there are no differences available and there is nothing in the formula 
itself by which we can estimate the reliability of the results obtained. We 
should therefore compute the remainder term in every application of this 
formula where it is possible to do so. 

It is to be observed, however, that the inherent error in Lagrange’s 
formula involves the (n -)- l)th derivative of the given function. When the 
analytical form of a function is not known, the inherent error due to the 
use of Lagrange’s formula cannot be estimated. 

The student should observe that the remainder term in Stirling’s formula 
contains odd differences, whereas in Bessel’s formula it contains even 
differences. If, therefore, when using a central difference formula we stop 
with even differences and wish to estimate the error, we should use 
Stirling’s formula; whereas if we stop with odd differences, we should use 
Bessel’s formula. If this rule is followed, the remainder term will always 
be the next term after the one at which we stop. 

There should never be any difficulty in determining the proper value 
of n to be substituted in the remainder formulas. Thus, if we are using 
Bessel’s formula and stop with third differences, the remainder term will 
contain fourth differences. Hence we must have -|- 2 4, or n — 1. 

On the other hand, if we are using Stirling’s formula and stop with 
fourth differences, the remainder term will contain fifth differences. Hence 
we shall then have 2n + 1 — 5, from which n — 2. 

We shall now compute the remainder term in an application of Bessel’s 
formula. 


Example. The following table contains values of the function 
y — 4- lOi* for certain values qf x. Find y when x — 2.27. 












110 


ACCURACY OF INTERPOLATION FORMULAS 


[Chap. VI 


Solution. Since we wish to use BessePs formula and compute the 
remainder term, we stop with third differences. Taking Xo S.2, x ~ 2.27, 
h — 0.1, we have 


or 


2.27 — 2.20 0.07 ^ „ 

ol oT-®’- 

t> — tt — J — 0.2. 

538.7888 + 671.6184 


y— 


2 


+ 0.2,(132.8296) 


^ ^ 0.04 — 0.25 ^ ^ 21.8990 + 24.9920 ^ 


+ 0.2 -g (3.0930) 


— 605.2036 + 26.56592 — 2.46178 — 0.02165, 
y — 629.28609. 


To find Rn we have 2n + 2 — 4, or n — 1. Also 

p^'ix) — 24 + 1200X. 

Hence 

py(() — 24 + 1200^. 

Now since £ lies somewhere between 2.0 and 2.5, we can express it in 
the form 

i _ 2.25 + O.li,, 

where rf lies between — 2.5 and + 2.5. Substituting this value of ( in 
p''(€) above, we get 

/iv(^) — /iv(2.25 + O.li,) — 24 + 2700 + 120ij 
— 2724 + I 2 O 17 . 

Hence by (5) of Art. 40 we have 

_ (0.04 — 0.25) (0.04 — 2.25) 

— 0.00627 + 0.000232i} 

— 0.00527 ± 0.00058. 



Art. 41] 

We therefore have 


ILLUSTRATIVE EXAMPLE 


111 


y — 629.28609 + 0.00527 ±. 0.00058 


— 629.29136 ± 0.00058. 

The value of y is thus between 629.2919 and 629.2908, or between 
629.292 and 629.291. The correct value to four decimal places is 629.2914, 
and this happens to be the mean of the two limits found above. 

If we substitute differences instead of the derivative in we have 
— (0.2664 + 0.2784 )/2 — 0.2724; and therefore by (6) of 

Art. 40 


0 2784 
— 0.00527, 


which is the definite part of the remainder term found by using the 
derivative. We then have y »=• 629.28609 + 0.00527 — 629.29136, which 
is correct to four decimal places. 


Note. The substitution i = Xm + hfi, where denotes the midpoint 
of the range of given values of the function, gives the remainder as the 
sum of two terms, the larger of which is perfectly definite and unaffected 
by the uncertain factor •>}. It also saves the trouble of finding the greatest 
and least values of /"(x) in order to find the limits between which the 
true value of the comjiuted function lies. For Newton’s formulas (I) 
and (II) we make the substitutions ^ — Zo + Aij and f — htf, 
respectively, where ij is now positive in each case. For computing f?* 
in Lagrange’s formula we should, put ^ — im + fci;, as in the example 
worked above. 

A final remark concerning accuracy must now be made. When the 
analytical form of a function is totally unknown, and the sura total of 
our knowledge of the function consists merely of a set of tabular values 
of the argument, the problem of interpolation is really indeterminate; 
for it is theoretically pogsiblc to construct a large number of functions 
which would take the values yo, yi, y*, • • • y* corresponding to the values 
Zo, Zi, Z 2 , * * ■ Zm of the argument. Nevertheless, if we have some knowledge 
of the nature of the function with which we are dealing and have no 
reason to believe that it behaves in an erratic manner within the range of 
values considered, we may fairly assume that its graph is a smooth curve, 
in which case the function can safely be replaced by a polynomial. 



112 


ACCURACY OF INTERPOLATION FORMULAS 


[Chap. VI 


42. The Accuracy of Linear Interpolation from Tables. We shall now 
derive a simple formula for the maximiun error inherent in linear inter- 
polation from tables. 

In the remainder after n 4* 1 terms in Newton’s formula (I) let us 
put n — 1. Then Bn becomes 


( 1 ) 




hTU) 

2 


u{u — 1) 


h'M 

2 


(tt* — tt). 


where M denotes the maximum absolute value of f'{x) in any interval 
of width h. To find the maximum numerical value of Ri we differentiate 
it with respect to u, put the derivative equal to zero, solve for u, and then 
substitute this value of u in (1). Hence we have 


dR^ 

du 


hHI 

2 


{2u — 1) 


0 . 


u 


|N, 


— i and 
2 14 



h^M 
8 • 


The formula for the maximum error is therefore 


( 2 ) 




Example. The function 1/N is tabulated in Barlow’s Tables at unit 
intervals from 1 to 12,500. Find the possible error in the linear inter- 
polation of this function when 


Solution. 


N — 650. 




Taking A — 1, JV — 650, and substituting in (2), we find 

1 1 

= 4 X (650)* 1,098,500,000 ’ . 

or 

E < 0.000000001. 


Note. The student should ever bear in mind that linear interpolation 
is permissible only when first differences are constant, or practically so. 



Art. 42 ] 


EXERCISES 


113 


He should therefore always compute a few first differences and see if they 
are constant before using linear interpolation. 

EXERCISES VI 

1. Estimate the error in the answers to Exercises 3 and 4 of Chapter II. 

2. Compute the error in the answers to Exercises 2 , 3 , 4, and 6 of 
Chapter IV. 



CHAPTER VII 


INTERPOLATION WITH TWO INDEPENDENT VARIABLES 
TRIGONOMETRIC INTERPOLATION 

43. Introduction. Occasionally it becomes necessary to interpolate a 
function of two arguments. For example^ a table of elliptic integrals con- 
tains the two arguments 6 and 4>y on both of which the value of the integral 
depends. 

The problem of double interpolation can be solved in two ways. The sim- 
plest method in tlieory is to interpolate first with respect to one variable and 
then with respect to the other. In making these interpolations any one of 
the standard interpolation formulas — Newton’s, Stirling's, or Bessel's — may 
be used for either the fir.st interpolations or the second. We always choose 
the most suitable formula for the problem at hand. 

44. Double Interpolation by a Double Application of Single Interpola- 
tion. This method can be explained be«t by means of examples. 

Example 1. The following table* gives the hour angle {() of the sun 
'•uiTO>poiiding to certain altitudes (a) and declinations (d) at a place 

111 a certain latitude. Find tlie hour angle corresponding to d === 12®, 

- 



a =*10^ 

14® 

IS® 

i 

1 

o 

20’’ 

gh 21m 26* 

5*» 

50“ 

!?• 

5h 29“ 

27* 

5 '‘ 

8“ 

48 

15 

5 55 41 

5 

35 

5 

5 

14 

39 

4 

54 

17 

10' 

5 40 16 

5 

19 

56 

4 

59 

37 

4 

39 

17 

5^ 

5 24 50 

5 

4 

30 

4 

44 

4 

4 

23 

29 

0*' I 

5 9 5 

4 

48 

29 

4 

27 

39 

4 

6 

28 


Sfjlution. Here we take the entry as the starting point. Then 

llic initial values of d and a are do — 15®, a©” 14®. 

Let f ^f{d,a) denote the functional relation * connecting d, and a. 
vW first find by ordinary interpolation the values of /(12®, J4®),/(12®, 18®), 
22®). To this end we construct the following difference tables 
corresponding to a = 14®, a — 18®, and a =* 22®. 

* A table of this kind is called the function table. The entries in this table are 
taken from Whittaker and Robinson’s Calculus of Observations, p. 374. 

114 



Art. 44] DOUBLE APPLICATION OF SINGLE INTERPOLATION 


115 


(a) 


(b) 


(c) 


0-14" 


/(<f, 14") 

A/ 

A*/ 

A*/ 

5'' 35“ 

05' 






1 

s 

1 



5 19 

56 


-17* 




-15 26 


-18* 

5 04 

30 


-35 




-16 01 



4 48 

29 





0-18" 


fid, 18") 

A/ 

A‘/ 1 

A’/ 

gh i4«. 

30* 







-15™ 02* 



4 

59 

37 


-31‘ 





-15 33 


-21' 

4 

44 

04 


-52 





-16 25 



4 

27 

39 





0 = 2 - 2 ' 


fid, 22°) 

A/ 

A*/ 


4h 54ni 

17* 

-15“ 

0* 



4 39 

17 

• 


-48- 




-15 

48 

i 

-25* 

4 23 

29 

-17 

1 

01 

-1“ 13* 


4 06 

28 






Since the required value 16®) of the function is near the beginning 

of the assigned values of d, we use Newton’s formula (I) to find /(12®.a). 
Furthermore, since the given equidistant values of d decrease by steps ot 5®, 
we have fe “ — 6® and therefore 

d— do _ 12— 15 

“ h " — 6 “ 


tt 


0 . 6 . 



116 INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 

Now substituting in (1) of Art. 20 this value of u and the other quantities 
from table (a) above, we have 

/(12* 14®) — 6»‘36“5* 4- 0.6(— (— 17«) 

2 


0.6(— 0.4)(— 1.4) 


— 5'‘26“1*. 

Using the values in table (b), we get 


(- 18 *) 


/(12® 18®) — 5‘'14'"39» + 0.6(— 15'”2») + — i——) . (— 31») 

2 


0.6 (— 0.4) (—1.4) 


— S^SMO*. 

In like manner, from table (c) we get 


(- 21 *) 


/(12®, 22®) — 4»‘54“17» -f- 0.6 (— 15’"0») + 


0.6 (—0.4) 


(—48*) 


0.6(— 0.4)(— 1.4) 


(-26») 


— 4‘'45“21». 

The next step in the solution is to form a difference table of these func> 
tions just computed. Hence we have 


/(12, a) 


S'* 26" 1* 


5 6 40 


4 46 21 



-20 19 


Now since the required value of the function is also near the beginning 
of the assigned values of a, we again use Newtqp’s formula (I). Also, 
since the equidistant values of a increase by 4®, we have h — 4®. Hence 


o— Oo 16® — 14® ,,,, 

u — — — - t ; — 0 . 6 . 


Substituting in (1) of Art. 20 this value of u and the other quantities 
from the tables above, we finally get 









Am. 44] DOUBLE APPLICATION OF SINGLE INTERPOLATION 


117 


/(12*, 16“) — + 0.5 (— 20®21») + (2») 

= 6'*16“50». 


Not$. If it should be required to compute /(14*’, 20°), for example, we 
would set out from the entry 6‘‘55“*41* and compute /(14°, 10°), /(14°, 14“), 
/(14°, 18°), and /(14°, 22°) by Newton’s formula (I). Then to find 
/(14°, 20°) we would use Newton’s formula (II), because the required 
value is near the end of the given values of a. 

Example 2. Find from a table of elliptic integrals the value of 

^•in-i ( 12 / 18 ) ^ 

«/• VI — 0.78 sin* ^ 


Solution. Comparing this integral with the standard elliptic integral 
of the first kind, namely 

C* dA 

F(tf,^)- I -==^==, 

^0 Vl — sin* sin* A 


we have 


^ — sin-* ^ — sin-* (0.9230769) — 67° 22’ 48".5 
— 67°.38014, 


sin* $ — 0.78, 
sin 9 — 0.8831761, 

$ — 62° 01' 40".4 — 62°.02789. 


In problems of this kind, where extensive tables are at hand, it is better 
to use central-difference formulas. Hence we write down the appropriate 
portion of the given function table, compute the necessary difference tables, 
and from them calculate the values of F(60°, 67°,38014), F(6l°, 67°.38014), 
F(62°, 67°.38014), FC63°, 67°.38014), and F(64°, 67°.38014) by means 
of Bessel’s formula (VI), because 67°.38014 is near the middle of an 
interval. Then* we form a difference table from these computed functions 
and find F(62°.02789, 67°.38014) by means of Stirling’s formula, (III), 
because here the value 62°.02789 is. near the beginning of an interval. 

The function table is given below, and from it the difference tables 
following are computed. 



118 


INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 


* 

mm 

61* 

62* 

63* 

64* 

66* 

1.3489264 

1.3559464 

1.3630180 

1.3701309 

1.3772732 

66 

1.3772777 

1.3847727 

1.3923331 

1.3999481 

1.4076057 

67 

1.4059999 

1.4139971 

1.4220753 

1.4302236 

1.4384298 

68 

1.4350955 

1.4436231 

1.4522494 

1.4609635 

1.4697532 

69 

1.4645657 

1.4736530 

1.4828589 

1.4921728 

1.5015826 

70 

1.4944109 

1.5040879 

1.5139061 

1.5238552 

1.5339233 


«-60* 


* 

F(60*,«) ' 

AF 

A*F 

A*F 

A*F 

A*F 

65* 

1.3489264 

283513 





66 

1.3772777 

287222 

3709 

25 



67 

1.4059999 

290956 

3734 

14 

-11 

-5 

68 

1 4350955 

294704 

3748 

- 2 

-16 


69 

1.4645659 

298450 

3746 




70 

1.4944109 

1 

1 




1 



B 

»61" 




0 

^’(61*, «) 

AF 

A*F 

A*F 

A*F 

A*F 

65* 

1.3559464 

288263 





66 

1 3847727 

292244 

3981 

35 



67 

1 4139971 

296260 

4016 

23 

-12 

+1 

68 

1.4436231 

300299 

4039 

11 

-11 ‘ 


69 

1.4736530 

304349 

4050 




70 

1.5040879 







(a) 


(b) 












Art. 44] DOUBLE APPLICATION OF SINGLE INTERPOLATION 


119 


«-62‘* 


* 

^’(62", ♦) 

AF 


A‘F 

A«F 

A»F 

65* 

1.3630180 

293151 





66 

1.3923331 

297422 

4271 

48 



67 

1.4220763 

301741 

4319 

36 

-13 

+1 

68 

1 4522494 

306095 

4364 

23 

-12 


69 

1 4828589 

310472 

4377 




70 

I 5139061 








e 

-63" 




0 

♦) 


A*F 

A‘F 

A«F 

A»F 

65® 

1 3701309 

298172 





66 

1 3999481 

302755 

4593 

61 



67 

1 4302236 

307399 

4644 

50 

-11 

-2 

68 

1 4609635 

312093 

4694 

37 

-13 


69 

1.4921728 

316824 

4731 




70 

1.5238552 








6- 

=.64" 




* 

F’(64», «) 

iiF 

A*F 

A‘F 

A«F 

A»F 

65" 

1.3772732 

303325 





66 

1.4076057 

308241 

4916 

77 

1 


67 

1.4384298 . 

313234 

4993 

67 

-10 

-4 

68 

1.4697532 

318294 

5060 

53 

-14 


69 

1.5015826 

323407 

5113 




70 

1.5339233 







(e) 


(d) 


(e) 



120 


INTERPOLATION— TWO INDEPENDENT VARIABLES [CeAr. VII 


Here 

^ — 67", ^ — 67“.38014, A — 1", 
u — 0.38014. 
v-u — i- — 0.11986. 

Substituting in Bessel’s formula (VI) the quantities given in table (a), 
we have 

F(60“, 67®.38014) — 1.4205477 — 0.00348740 — 0.00004408 
+ 0.00000001 — 0.00000003 
— 1.4170162. 

In a similar manner we get from tables (b), (c), (d), (e), 

67°.38014) — 1.4252117, 

1!’(62", 67‘’.38014) = 1.4334946, 

F{63^, 67®.38014) — 1.4418540, 

^’(64°, 67“.38014) -= 1.4502779. 

Forming now a table of differences from these computed functions, 
we have 













Art. 45 ] 


DOUBLE OR TWO-WAY DIFFERENCES 


121 


F(62®.02789, 67®.38014) — 1.4334946 + 0.00023208 

+ 0.00000003 + 0.00000005 
— 1.4337268. 

45. Double or Two-Way Differences. Before explaining the second 
method of dealing with the problem of double interpolation it is necessary 
to define double or two-way differences, to which we now turn our attention. 

Let z~^f{x,y) denote any function of two independent variables x and 
y, and let Zri=^f{xr,y»)- Let us next construct the following function 
table : 



Xo 

Xi 

Xt 

Xz 

Xi 




Xm 

yo 

Zoo 

Zio 

*10 

*10 

*40 

. . . 

. . . 


*"•0 

Vi 

Zoi 

2il 

*11 

*11 

*41 






Zoi 

*11 

*11 

*11 

*41 




*•1 

yt 

Zos 

*11 

*11 

*11 

*41 




*mM 

yi 

. 

Z04 

*14 

*14 

*14 

*44 


. . . 


*"•4 

yn 

^On 

*ln 

*ln 

*1.1 

*411 


• • • 


Urn 


We now define double or two-way differences as follows ; 

** A*2 oo ““ *oo» 

“■ A^Zoi ^ Zn — Zoi, 

A**®Zo» Aj(Zo2 Z\2^~ Zojf 


A*''^Zoo A|)Zoo * Zoi — Zoo> 

A*'*‘Z|o AyZ|o " Zn Z|o, 

A***Zjo AkZ 20 “• *21 *20» 


Or, more generally. 


A***Z|>» A*Zr» Zrti.a Zrtt 
A***Zr* ■“ ^ Zr,$^i Zrt^ 









122 INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 

Also, 

A**®2Ioo *“ ^20 2^10 “1“ Zo09 

A*^®«oi “ Ax *2^01 “ ^21 — 2^11 -|- Zqi, 

A*^®So2 “ Ax*^02 “ ^22 2^12 “4" ^02> 

A®^*SoO " Ay*^00 “ ^02 2*01 “f“ ^00, 

A®**Zio “ Ay*^io “ Zi2 — 2^11 -}“ ^lOf 

A®^*220 *“ Ay*Z20 " ^22 2*21 + Z20f 

A*^^2oo — A**%i — 

A'^*2Joo ■“ A®^*^io — A®’‘*^oo> 

A^'^°2oO “ Ax*2IoO “ ^30 3^20 4 “ 3^10 ^OOf 

A^*^Zot ” Ax*2oi “ ^31 — 3221 "h 32 ii — 2oi, 

A®*®2oo “ Ay*2oo “ Zq3 3^02 4“ 3^01 2oOi 

A®^®2 io “ Ay*2io “ 2ia — 3^12 4 " 32ii — Zio, 

A^^%0 — A*^%1 — A*^®2oo, 

A‘^^2oo — A®^*2io — A®^*2oo, 

A^*®2oo *" Ax^2oo “ Z40 — 4230 4 * 6^20 — 42 10 4 “ ^00 f 

A®**2 oo Ay*2oo “ ^04 42 o 3 4 " ®^02 42 oi 4 " ^OOf 

A^*%0 — A*^®2o2 — 2A**®2oi 4- A**%0, 

— A®"*22o — 2 A®^* 2 io 4 - A®**2oo. 

The general formula for writing down these differences is easily seen 
to be 

( 1 ) A-'-^oo — A«*®2 o.,-, + • • • 

+ A"*^Zoo 

= A®*"«„o — toA®*"*...j.o + — " — ^ A®*"2«.2,o -f • • • 

+ A®*"Zoo . 

The symbol Ag”Zo9, for example, means that we find the mth difference 
of Zoo tetfh respect to x, y being held constant. 

46. A General Formula for Double Interpolation. We are now in a 
position to consider a general formula for double interpolation. The 
following formula is derived in 0. Biermann’s Mathematische Naherungs- 
methoden, pages 138-144 : 



A». 4«] 


FORMULA FOR DOUBLE INTERPOLATION 


123 


+ ] + . . . 

, 1 r (x— x.)(i— »,)■ ■ (I— i,-,) ^...^ 
~ wi ! L fc"* ^ 

. m(!r — Xo)(x — Xt) • • • (g— x^.O(y — Fo) 


+ 

X 

+ 


m(m — l)(x — Xo)(x — Xi) • • • (x — T„.i)(y — yo)(y — yO 
2 


A('"-*)**aoo + • • • 

(y-y.Xy-yO- • (y-y-'V -.^] + 


Here A and )(; are the intervals between the equidistant values of x and y, 
respectively, and R{xo,yo) is the remainder term. 

This formula can be simplified by changing the variables from x and y 
to « and V, as follows: 

Put 


Then 


and 


•F — — Xq I » 

u = 1 — , or X = Xo 4- Au. 


X Xi T — (Xo + A) X — Xo _ , 

_ ’ 

X — Xj X — (x„4-24) X — X, 2A , „ 

KES BS as 

ft ft ft hf 


X T„-1 

A 

Also, put 


Then 


y — y» 

k 

y — yy 
h 


= « — (m — 1). 


r T- or y — yo + Av- 

y — (yo + A) _ y — yo A 
A k k 

V — 2, etc. 


» 


1 , 


9 



124 INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 

Substituting these values of (« — Xo)/h, (if — yo)/K etc. in (1), we get 

(X) z — f(x, y) — /(xo + fctt, yo + hv) — *oo 4- ttA**®*oo + 

+ ^ — l)A**»Soo + 2tt»A‘*‘«oo + V(v — l)A***Soo] 

+ ^ [v(» — 1)(« — 2 )A**®*oo + 3w(tt — l)uA»‘‘Soo 
-j- 3ti»(« — 1) A‘**2 oo + u(u — 1)(» — 2)A****oo] 

+ ^ [ti(tl — l)(tt — 2)(tt — 3)A**®ioo + 4tt(u — 1 )(m — 2)»A***2,o 

-f- 6tt(u — l)v(v — l)A***Soo + 4tiv(e — l)(w — 2)A****o« 

+ v(v — I)(e — 2)(« — 3)A®*®Zoo] + Rm(Xo, yo), 

where 

Rn (xo, y®) (n 1 )T — l)(u — 2) • • (u— n)A<®*»>*®Soo 

+ (n-)-l)u(tt — l)(tt — 2)- • • [u — (n — l)]vA"***oo 
+ — !)• • • [u — (n — 2)]v(v — 1 )A<"-*>** 2 oo 

+ ■ ' * + ®(® — 1)(® — 2)* • • (e — n)A®*<*+'>2oo] 


This formula (X) corresponds to Newton’s formula (I) and reduces to 
that formula if we put either u — 0 or v — 0. 

In some applications of mathematics, particularly in Navigation, linear 
interpolation with several arguments is of considerable importance. For 
example, in various navigation tables are tabulated the complete solutions 
of thousands of astronomical triangles. Here the one or two desired parts 
are functions of three arguments. 

Formulas for linear interpolation with several arguments are readily 
found from the general formula (1) and from extensions of that formula. 
Thus for two arguments, after neglecting all differences higher than the 
first, we have 

(2) 2 "I Zoo H ^ (A*2oo) (AyZoo). 

For a function of three arguments, as u — f(x, y ,z), we have 

(3) ft " ttooo H ^ (A»1toOo) 4" ~ (Ayltooo) *4" ^ (A»ttooo) J 


and so on for any number of arguments. 



Abt. 46] 


FORMULA FOR DOUBLE INTERPOLATION 


125 


We shall now apply formula (X) to the two examples which haye already 
been worked by the first method. 

Example S. Solve Example 1 of Art. 44 by means of formula (X). 

Solution. For the sake of clearness we repeat the function table given 
in Example 1, and work the problem anew from the start 


d 

a -14® 

18* 

22* 

16* 

5“ 35- 5* 

5" 14- 39* 

4k 54- 17* 

10* 

5 19 56 

4 59 37 

4 39 17 

5* 

5 4 30 

4 44 4 

4 23 29 

0* 

4 48 29 

4 27 39 

4 6 28 


Forming next the necessary difference tables, we have 


a,- 14” 



0,-18* 


























136 


INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 


01 - 22 * 



These three tables, it will be observed, are the same as tables (a), (b), 
(e) in Example 1. 

We next form difference tables by taking constant values of d. 


dt-lS* 



di-10* 



d,-6* 

































Am. 46] 
Hence 


FORMULA FOR DOUBLE INTERPOLATION 


127 


a‘*7<h. 


-A‘*®/o« 

15“2® — 

(— 15“9®) — 7®, 

A*'*/oo 

-A®**Ao 

-A®**/«o 

!•_ 

(4®) 

5®, 

A**»/«o 

-A**«/o, 

-A**®/o, 

- — 31*- 

-(- 

1 

1 

A»**/«« 

-A®*«/,o 

-A®*»/o. 

o 

1 

0 

1 

o 

1 


A**‘/oo 


— A®*®/oo 

— -21®- 

-(- 

-18®) — 3®, 

A»*»/oo 

-A**®/o, 

— 2A**®/o, + A**®/o* 




— — 48' — 2(— 31*) + (— 17') — — 3*, 
A®*Yoo 0* 


We have already found in Example 1 that 

u — 0.6, t; — 0.6. 

Substituting in (X) these values of u, v, and the computed differences, 
we get 

/(12® 16®) — 6*‘36”5* + 0.6 (— + 0.5 (— 20“26*) 

+ i[0.6(— 0.4) (—17') +0.6(7*) +0.6(— 0.5) (4»)] 

+ 0-4) (— 1.4) (— 18*) + 0.9 (— 0.4) (— 14*) 

+ 0.9(— 0.5)(— 6*) +0] 

+ A [0 + 1.2(- 0.4)(- 1 .4)(- 3*) + 1.8 (- 0.4)(- 0.6)(- 3*) 

+ 0 + 0 ], 

or /(12®, 16®) =— 5**15'"50*. as previously found. 

Example Solve Example 2 by means of formula (X). 

Solution. Since (X) is not a 'central-difference formula, we do not use 
the same function table as in Example 2. From the definition of the 
two-way differences A"*"2oo it will be seen that the following triangular 
function table, starting from ^*(62®, 67®), is all that is required for finding 
all differences up to the fourth order inclusive. 


♦ 

S-62* 

• 63* 

64" 

65" 

66" 

67“ 

1.4220753 

1.4302236 

1.4384298 

1.4466803 

1.4549598 

68 

1.4522494 

1.4609635 

1.4697532 

1.4786046 


69 

1.4828589 

1.4921728 

1.5015826 



70 

1.5139061 

1.5238552 




71 

1.5453920 



















128 


INTERPOLATION— TWO INDEPENDENT VARIABLES [Chap. VII 


The following difference tables are next computed: 

s*-62* 


A»+>P,« 


A*+*P,* 






Ff* 

A'+tf',* ' 

1.4384298 

313234 

1.4697632 

318294 

1.6016826 






























Art. 46] 


FORMULA FOR DOUBLE INTERPOLATION 


129 


♦.-67“ 



Fn 

A‘+*F|o 

A*^Fi0 

A***F0O 

A«'«FiO 


1.4220763 

81483 




6i 

1.4302236 

82062 

579 

-146 


6i 

1.4384298 

82505 

433 

-143 

3 

6i 

1.4466803 

82795 

290 



»* 

1.4649598 






♦t-68“ 



^$1 

A‘-**F#i 

A*+*F*, 

A»^F*i 

6* 

1.4522494 

87141 




1.4609635 

87697 

756 

-139 


1.4697532 

88514 

617 


•• 

1.4786046 





♦t-69* 



Fn 

A*«F#2 

A*«Fb 

«« 

1.4828589 





93139 


6i 

1.4921728 


959 


• 

94098 


6| 

1.6015826 

1 



Hence 


Foo — 87141 — 81483 — 6668, 
A»**Foo — A®**Fio — A®**Foo — 4694 — 4364 — 340, 

A*«Foo — — A***Foo — 766 — 679 — 177, 

A^^'Jfoo — A***F„ — A®**Foq — 37 — 23 — 14, 

A»*»Foo — A»«F*i — A»*®Foo — — 139 — (— 146) — 7, 
A***Fo« — A**«Fo, — 2A*^Foi + A**»F«o — 969 — 1612 + 6'»9 
— 26. 
































130 INTEKIKILATION— TWO INDEPENDENT VARIABLES [Chap. VII 

In Example 2 we found u •— 0.02789, v — 0.38014. Substituting in (X) 
these values of u, v, and the computed differences, we get 

F(62.®02789, 67.'’38014) — 1.4220753 + 0.02789(81483) + 0.38014(301741) 
+ J[0.02789(— 0.97211)(579) + 2(0.02789)(0.38014)(6658) 

+ 0.38014(— 0.61986)(4354)] 

+ K(0-02789) (— 0.97211) (— 1.97211) (— 146) 

+ 3(0.02789) (— 0.97211) (0.38014) (177) 

+ 3(0.02789) (0.38014) (— 0.61986) (340) 

+ 0.38014(— 0.61986) (— 1.61986) (23] 

+ i*l[002789(— 0.97211)(— 1.97211)(— 2.97211)(3) 

+ 4(0.02789) (—0.97211) (— 1.97211) (0.38014) (7) 

+ 6(0.02789) (— 0.97211) (0.38014) (— 0.61986) (26) 

4- 4(0.02789) (0.38014) (— 0.61986) (— 1.61986) (14) 

4- 0.38014(— 0.61986) (— 1.61986) (— 2.61986) (— 13)] 

— 1.4337264. 

This value differs from that found in Example 2 by four units in the 
last decimal place; but in view of the fact that different parts of the 
function table, different formulas, and different methods were used in 
the two computations the agreement is as close as could be expected. 

Note. The two methods explained in this chapter are sufficient for the 
solution of all ordinary problems of double interpolation. As to which 
of these methods is preferable, it may be said that the use of formula (X) 
is probably shorter if all differences above the second are negligible. 

For a more extensive treatment of double interpolation the reader should 
consult Steffensen’s Interpolation, pp. 203-223, and Tracts for Computers 
No. Ill, Part II, by Karl Pearson. 

« 

47. Trigonometric Interpolation. When the function we desire to 
represent by an interpolation formula is known to be periodic, it is better 
to use trigonometric interpolation. Hermite’s formula for interpolating 
periodic functions is 

8in(a; — gQ sin(a; — x^) • • -sintj — r„) 
sin(xo — Xi) sin(xo — x*) • • - sin (Xq — x„)^“ 
sin(x — Xq) 6in(x — x^) • • •8in(x — tx,) 
sin(xi — Xo) sin(Xj — Xj) • • •8in(xi — x*)^* 

+ ■ ' ' 

8in(x — Xq) 8in(x — X|)- • •8in(x — x,.,) 

8in(x, — Xo) 8in(x, — Xi) • • -sin^x,— Xn-i)^"' 


(XI) y 



Art. 47] 


TRIGONOMETRIC INTERPOLATION 


131 


This function has the period 2ir, as may be seen by replacing z by s 8r. 
It is evident also that y — y© when z — i x©, y — yi when x -• etc. 

This formula of Hermite’s for periodic functions corresponds to 
lAgrange's formula for non-periodic functions (Art. 27), and applies 
whether the given values of x are equidistant or not. By interchanging 
X and y in Hermite’s formula we get a formula for the inverse interpola- 
tion of periodic functions, corresponding to (27.3) of Art. 27. 

Example. Given the following corresponding values of x and y, find 
the value of y corresponding to z — 0.6, the values of z being in radians : 












132 INTERPOLATION— TWO INDEPENDENT VARIABLES [Chak VII 

the log sines being given directly in the Smithsonian Mathsmatical TableSp 
Hyperbolic Functions, Table III. 

Note. The problem of trigonometric interpolation was first solved by 
Oauss,* who derived several formulas similar to Hermite^s. The formula 
usually called Gauss's formula differs from Hermite's only in having the 
factor i written in front of all the angles; thus, sin — Xq) etc. It is 
believed, however, that Hermite’s formula is simpler than any of the Gauss 
formulas. 

BZBSCISB8 VU 

1. Using the data of EzAmple 1, Art. 44, find by two methods the hour 
angle of the sun when a 12° and d « 16**. 


Wsrke, Baad III, pp. 260*327. 



CHAPTER VIII 


NUMERICAL DIFFERENTIATION AND INTEGRATION 


1. NUMERICAL DIFFERENTIATION 


48. Numerical Differentiation is the process of calculating the deriva- 
tives of a function by means of a set of given values of that function. The 
problem is solved by representing the function by an interpolation formula 
and then differentiating this formula as many times as desired. 

If the function is given by a table of values for equidistant values of the 
independent variable, it should be represented by an interpolation formula 
employing differences, such as Newton’s, Stirling’s, or Bessel’s. But if the 
given values of the function are not for equidistant values of the inde- 
pendent variable, we must represent the function by Lagrange’s or Hermite’s 
formulas. 

The considerations governing the choice of a formula employing dif- 
ferences are the same as in the case of interpolation. That is, if we desire 
the derivative at a point near the beginning of a set of tabular values, 
we use Newton’s formula (I). Whereas, if we desire the derivative at a 
point near the end of the table, we use Newton’s formula (II). For points 
near the middle of the table we should use a central-difference formula — 
Stirling’s or Bessel’s. 

The values of derivatives in terms of differences may also be found by 
means of those interpolation formulas which employ differences. Thus, 
from Stirling’s formula we have, since 


X — Xq ^ ^ ^ ^ « 1 ^ 

^ h dx du dx h du* 


y — »o + 


Ay -1 + Ay, 
2 


u** 




— 1) A»y., -I- AVt 


31 


2 


+ Ay-,-r- gj 


A»y.,-HA»y., 


+ 


— !)(«* — 2*) 
61 


A‘y., + 


dx 


1 r Ay -1 + Ayo , . 3tt« — 1 A«y., -f A»y.i 

hi. 2- 31 2 


+ 

+ 


4tt* — 2tt ^ ^ , 5u* — 15u* -f- 4 A*y., -f A*y.* 

41 61 2 

6tt» — 20u*-l-8tt 
61 


A*y-i + ••*]» 


133 



184 


NUMERICAL DIFFERENTIATION 


[Chaf. VIII 


S' - +« I + — «i— * 


, 80ll»-80ll AV. + AV. , 80ll«-»0l.‘ + g .... ...T 

6! 2 6! J» 


(fx* 

^1 


+ 

d*y 

1 1 

da^ 

h*\ 

<Py ^ 

1 

dafi 

A* 


1 

_ 

da* 

A* 


60a* — 30 A*y_i + AVa 
61 2 


120tt* — 120a 
61 


A*y-. + •••], 


For the point x — Xo we have a — 0. Hence on substituting this value 
of a in the formulas above, we get 



I TAy.i + Ayo 1 A»y .2 + A*y-, 4 A»y.,-f A*y.z 

"fcL 2 3! 2 "^61 2 

1 rA»y., + A»y.i 30 A'y., + A»y.n 

“"h*L 2 6! 2 J’ 


1 r A»y-, + A»y -2 


+•••] 


A [AV. + • • •]. 


Evidently we can find the derivatives in exactly the same way by dif- 
ferentiating Newton’s, Bessel’s, and Lagrange’s formulas. 

To find the maximum or minimum value of a tabulated function we 
compute the necessary differences from the given table, substitute them 
in the appropriate interpolation formula, put the first derivative of this 
formula equal to zero, and solve for a. Then x is found from the relation 
X ■— Xo hu. 

We can also find the maximum or minimum value of a function by 
equating to zero the first derivative of Lagrange’s formula. 



An. 48] 


NUMERICAL DIFFERENTIATION 


136 


Examfie. Find the first and second derivatives of the function tabu- 
lated below^ at the point x — 0.6. 


X 

y 

Ay 

A^y 

A*y 

A*y 

0.4 

1.6836494 

2137932 




0.5 

1.7974426 

2467960 

330018 

34710 


0.6 

2.0442376 

2832678 


38368 

3648 

0.7 

2.3276064 

3235764 

403086 



0.8 

2.6610818 






Solution. Here Xo — 0.6, u — 0, A — 0.1. Substituting in the formulas 
for the first and second derivatives at x Xo the appropriate differences 
from the table above, we get 


is. 

dx 

dx* 


10 [0.2650314 — 0.0006089] — 2.644225, 
100 [0.0364728 — 0.0000304] — 3.64424. 


The function tabulated above is 


Hence 


Putting X 


' 2e* — X — 1. 


I-*—'- 

0.6 in these, we get 

^ — 2.644238, ^ — 3.644238 
dx ’ dx* 


as the correct values for the first and second derivatives. The values found 
by numerical differentiation are therefore correct to five significant figures 
in the case of the first derivative and to six significant figures in the case 
of the second derivative. 

Partial derivatives of a tabulated function of two independent variables 
can be found by differentiating partially formula (X) of Art. 46. 















196 


NUMERICAL INTEGRATION 


[Chap. VIII 


IL NUMERICAL INTEGRATION 


48. Introduction. Numerical integration is the process of computing 
the value of a definite integral from a set of numerical values of the 
integrand. When applied to the integration of a function of a single 
variable, the process is sometimes called mechanical quadrature’, when 
applied to the computation of a double integral of a function of two 
independent variables it is called mechArUcal cubature. 

The problem of numerical integration, like that of numerical differentia- 
tion, is solved by representing the integrand by an interpolation formula 
and then integrating this formula between the desired limits. Thus, to find 
the value of the definite integral f^ydx, we replace the function y by an 
interpolation formula, usually one involving differences, and then integrate 
this formula between the~limits a and b. In this'way we can derive quadra- 
ture formulas for the approximate integration of any function for which 
numerical values are known. We shall now derive some of the simplest 
and most useful of the quadrature formulas. 

50. A General Quadrature Formula for Equidistant Ordinates. In 
Newton’s, Stirling’s, and Bessel’s interpolation formulas the relation con- 
necting X and u is 

( 1 ) x^Xo + hu, 

from which we get 

(3) dx^hdu. 

Let us now integrate Newton’s formula (I) over n equidistant intervals 
of width A(»Ax). The limits of integration for x are Xo and Xo-\-nh. 
Hence from (1) the corresponding limits for u are 0 and n. We there- 
fore have 

/J'" td. - kf' (,. + 

^ u(u—l)(u—2)(u — 3) u(u— l)(u — 2)(tt— 3)(tt— 4) 


+ 

or 

(50.1) 


u{u — 1) (tt — 2) (m — 3) {u — 4) (tt — 5) 
6! 


A*yo + • 




2 
lln* 


3n* 


) 4! 


60n» 

3 


12n* 


/ 5f 


, /n» 15n* , . 226n* . 274n» 


3 


•60»* 




61 J’ 



Art. 61] 


SIMPSON’S RULE 


137 


From this general formula (60.1), we can obtain a variety of quadra- 
ture formulas by putting n — 1, 2, ■ ■ * , etc. The best two are found by 
putting n — 2 and n — 6. 


51. Simpson’s Rule. Putting n — 2 in (50.1) and neglecting all dif- 
ferences above the second,* we get 

ydx — fc ^2yo + 2Ayo+ 2)^-^ 

— h[2yo + 2yi — 2yo + — 2y» + yo)] 

”■ 3 (y® "I" '^y* y*)* 

For the next two intervals from Xg to Xt -{■ 2h we get in like manner 

^ **+*» h 

ydx^- (y* -I- 4y, + y«). 

Similarly for the third pair of intervals we have 
C ***** h 

J, (y« + ‘*y» + y®); 

and 80 on. Adding all such expressions as these from Xq to Xn, where n 
is eveUy we get 


X 


xo*nh 

ydx — -(yo + 4yi-|-y, + yj4-4y3 + y4-f y« + 4y.-l-y,-l-- • •), 

So ^ 


or 


ydx — -(yo-f4y,-f2y2+4y3-|-2y«-f • • -f 2 y,. 2 -f 4y«.,-|-y,) 


■ 3 [y® + ■*(yt + ys + ■ ■ 4* y»-i) + 2(y2 + y4 + 
• • -f y«-*) + y»]- 

ft " * 

-?Scy. 

O 4 


where c -■ 1, 4, 2, • • • 2, 4, 1. 


This important formula is known as Simpson’s Rule. It is probably 
the most useful of all the formulas for mechanical quadrature. 

When using this formula the student must bear in mind that the 
interval of integration must be divided into an even number of sub- 
intervals of width h. 


* Since the interval of integration extends only from to «• + 26, there are only 
the functional values Voi Vu V* in this interval. Hence with only three values, there 
can be no differences higher than the second. 



138 


NUMERICAL INTEGRATION 


[Chat. VIII 


The geometric significance of Simpson’s Rule is that we replace the graph 
of the given function by n/2 arcs of second-degree polynomials, or para- 
bolas with vertical axes. 


52. Weddle’s Rule. Putting n — 6 in (60.1) and neglecting all dif- 
ferences above the sixth, we have 

J*** yds — A 6yo + ISAy, + 27 + 24A*yo + ^ A*yo 


. 33 A6 . 41 
+ 10^^“+ 140 



Here the coefficient of A*yo differs from 3/10 by the small fraction 1/140. 
Hence if we replace this coefficient by 3/10, we commit an error of only 

j^A*yo. If the value of h is such that the sixth differences are small, 

the error committed will be negligible. We therefore change the last term 
to (3/10) A*yo and replace all differences by their values in terms of the 
given y’s. The result reduces down to 

ydx — [yo -f- 5y, 6y, -f y* + Sy, -f y.]. 

For the next set of six intervals from x, to Xi, we get in the same way 

y*? “ JQ [y« + Syr + ys -f Oy, -|- y,o + 5y„ + y,,]. 


Adding all such expressions as these from Xo to Xn, where n is now a 
mvltiple of six, we get 


f 


y<f® *“ jQ [yo + 3yi + ya + 6yi + y4 + 6y* + 2y# 4 - Sy? + y» 

4 " 3y» “h yio 4 ' syn 4 " 2yi2 + • • • 

4" 2yn-9 "i" ^yn-i 4” y*-* “1“ 3y» a y^-a 4* syn-i 4" y*]. 

3^ 

■io?**' 


where fc — 1, 5, 1, 6, 1, 5, 2, 5, 1, 6, 1, 5, 2, etc. 


This formula is known as Weddles Rule. It is more accurate, in general, 
than Simpson’s Rule, but it requires at least seven consecutive values of 
the function. 

The geometric meaning of Weddle’s Rule is that we replace the graph 
of the given function by n/6 arcs of fifth-degree polynomials. 



Art. 52] 


WEDDLE'S RULE 


139 


We shall now apply these formulas to two examples, chosen at random. 
Example 1. Compute the value of the definite integral 



5.a 

In xdx. 


Solution, We divide the interval of integration into six equal parts each 
of width 0.2. Hence K — 0.2. The values of the function y — In x are 
next computed for each point of subdivision. These values are given in the 
table below. 


X 

In X 

X 

In X 

4.0 

1.38629436 

4.8 

1.56861592 

4.2 

1.43508453 

5 0 

1.60943791 

4.4 

1.48160454 

5.2 

1.64865863 

4.6 

1.52605630 




(a) By Simpson’s rule we have 

la — ^[3.03495299 + 4(4.57057874) + 2(3.05022046)] = 1.82784726. 

(b) By Weddle’s rule we get 

ly, =. (0.3) (0.2) [3.03495299 + 5(3.04452244) 

+ 3.05022046 + 6(1.52605630)] =■ 1.82784741. 
The true value of the integral is 

/ — r In a: dx «= x(ln X — 1 ) ~j — 1.82784741. 

J , -I 4.0 

Hence the errors are 

Ba — 0.00000015 — 15 X lO"*, 


Ew ^ 0 - 


Example 2. Compute the value of the definite integral 

f 1.4 

I (sinx — lnx + «*)dx. 

Jo.* 

Solution. We shall divide the interval of integration into twelve equal 
parts by taking A — 0.1. The values of the function y — sin x — In x -j- 
are then computed for each point of subdivision. These values are given 


10 






140 


NUMERICAL INTEGRATION 


[Chap. VIII 


in the table below. 


z 

y 

z 

y 

0.2 

3.02951 

0.9 

3.34830 

0.3 

2.84936 

1.0 

3.55975 

0.4 

2.79754 

1.1 

3.80007 

0.5 

2.82130 

1.2 

4.06984 

0.6 

2.89759 

1.3 

4.37050 

0.7 

3.01465 

1.4 

4.70418 

0.8 

3 16605 




(a) By Simpson’s rule: 

^ [ 3 . 02951 + 4.70418 + 4 ( 20 . 20418 ) + 2 ( 16 . 49077 )] — 4 . 05106 . 


(b) By Weddle’s rule: 

Iw — 0 . 03 [ 21.05841 + 5 ( 13 . 58281 ) + 6 ( 6 . 62137 ) + 2 ( 3 . 16605 ) ] — 4 . 05098 . 

The true value of the integral is 
/— r (sin I — lna; + c*)d*- 
— 4 . 05095 . 

Hence the errors are : 

Ea 0 . 00011 , 

Ew 0 . 00003 . 


J 1.4 

O.S 


It will be noted that Weddle’s rule is more accurate than Simpson’s in 
both examples. 

Although Weddle’s rule is simple in form and very accurate, it has the 
disadvantage of requiring that the number of subdivisions be a multiple 
of six. This means that when computing the values of y in many problems 
the assigned values of x can not be taken as simple tenths, as was done in 
the two examples worked above. The subdivision by tenths is nearly always 
possible when using Simpson’s rule. However, when Simpson’s rule can 
not give the desired degree of accuracy, Weddle’s rule should be used. 

When several values of the function are given, as in the above examples. 









Abt. 62] 


ILLUSTRATIVE EXAMPLES 


141 


it is better to make the computation in tabular form, aa ahown below for 
Simpaon^a rule in Ex. 1. 


X 

Inz 

e 

elnx 

4.0 

1.38629436 

1 

1.38629436 

4.2 

1.43508453 

4 

5.74033812 

4.4 

1.48160454 

2 

2.96320908 

4.6 

1.52605630 

4 

6.10422520 

4.8 

1.56861592 

2 

3.13723184 

5.0 

1.60943791 

4 

6.43775164 

5.2 

1.64865863 

1 

1.64865863 

27.41770887 X ^ — 1.82784726. 
o 


The reader ia cautioned againat thinking of quadrature formulae aa 
aimply methoda of computing areaa under curvea. Theee formulae are 
methods for computing the values of definite integrals. They give areaa 
only when the integranda are the ordinatea of a curve. The integrand may 
be any function of x, provided it ia known for equidiatant valuea of x. 
Thia fact ia illuatrated by the following example. 

Example 3. Find by Simpaon’a Rule the coordinated of the centroid and 
the moment of inertia about the x-axia for the plane area ahown below. 


Y 



Fig. 2 

Solution. The formulae for the Mntroid give 

. _ (V3) (gpyo + 4j,y, + 2xaya -f • • ) ___ %cxy 

"" /• dil f^ydx ( V3)(yo + 4yi + 2yj -f- • • •) 

. /;fy/2)dii fSy^dx ( V3)(yo« + + • • ) , 

Jj dA y*f (k/3)(yo + 4yi + 2y, + • • •) "* 


142 


NUMERICAL INTEGRATION 


[Chap. VIII 


Since the moment of inertia of a rectangle about its base is hh*/Zf the 
moment of inertia of the elementary area ydx about the x-axis is y*(2x/3. 
Hence for the whole area we have 


- 1/3 y» (fx - 1/3 X ( V3)(yo* + 4y,* + 2y/ + •••)- ( W Scy*. 
The computation form for this example is therefore as shown below: 


X 

y 

®y 

y* 

y* 

C 

cy 

cxy 

cy^ 

cy* 

Xo 

yo 

xoyo 

yo* 

yo* 

1 

yo 

Xoyo 

yo* 

yo* 

Xi 

yi 

xiyi 

yi* 

yi* 

4 

4yi 

4xiyi 

4yi* 

4yi* 

Xi 

y* 

xtyt 

ya* 

ya’ 

2 

3ya 

2xtyt 

2ya’ 

2y,* 

Xt 

• • 

ya 

• i 

xtyt 

» • • 

y.* 

• • 

ya* 

• • 

4 

• • 

4ya 

4x,yj 

• • • 

4y,* 

• • • 

4yz® 

• • 


Si St St St 

Hence 

x^Si/Sif V'^^St/Sif Iz (h/9i)St, 

where the S*s denote the sums of the numbers in the columns above them. 

Any other example can be handled in a similar manner after it has been 
set up as a definite integral. 

52A. The Trapezoidal Rule. The Simpson and Weddle formulas for 
approximate integration require that the interval of integration be divided 
into an even number of subintervals. In some applications of numerical 
integration it is more conxenient to divide the interval into an odd number 
of subintervals. A formula which is applicable to any number of sub- 
intervals, whether even or odd, is the Trapezoidal Rule: 

(52.1A) J ydx=-^h{^yo + yi+yt + - +y»-i + iy,) 

Inherent error —> ® ^ 

When yo.yi.y?, • • yn are ordinates to a curve y = /(x), formula 
(52. lA) gives the area under the curve as the sum of the areas of inscribed 
or circumscribed trapezoids of altitude h and bases ijo and yi, yi and yt, etc. 

In finding the volumes of solids by the trapezoidal formula, the fun<-tional 
values yo, yi, yz, etc., would be the cross-sectional areas of the solid at 
intervals h units apart, and the volume of the solid would be the sum of 
frustums of thickness h. 




Axt. SSA] 


TRAPEZOIDAL RULE 


14S 


The trapezoidal formula can be derived by adding the areas of all trape- 
zoids between and 2 — 6; or by putting n — 1 in (60.1) and eval- 

uating over single subintervals the first two terms in the right-hand member, 
all differences higher than the first being dropped because they involve more 
than two functional values. Thus, for the several subintervals we have : 


— fc(yo + iAyo) — h(yo + 
f^dx - A(yx + iAy,) ^h{y, + 

etc. 

Adding these integrals, we have: 


“ A(iyo + yi + y* + • + y*-! + iy*) • 

The trapezoidal rule is the simplest of the formulas for numerical inte- 
gration, but it is also the least accurate. The accuracy of a result can be 
improved by decreasing the interval h. 

Example. The application of the trapezoidal rule to the data of Ex. 1, 
p. 139, gives 




In xdx — 1.8376551, with an error of 0.0001923. 


Note. A quadrature formuli^ derived from an 
algebraic polynomial will not give a reliable result in 
regions where the graph of the given function is verti- 
cal, because an algebraic polynomial is never vertical 
and therefore cannot be made to coincide with a verti- 
cal segment of a curve. A simple and satisfactory way 
of handling such a case is to replace the curve in the 
vertical region by a parabola having a horizontal axis. 
Thus, for the curve shown in Fig. 3 we replace the arc 
MP by an arc of the parabola y* — 2px. Then the 
area of the segment MPN is (2/3)2iyt. 



Fig. 3 



144 


NUMERICAL INTEGRATION 


[Chap. VIII 


58. Central'Difference Quadrature Formulas. By integrating Stirling’s 
and Bessel’s interpolation formulas we can derive rapidly converging quadra- 
ture formulas in terms of differences. Thus, integrating Stirling’s formula 
from X — Xo — Atox — Xo-|-h, ortt — — Itou^l, we have 

A»y.» + A»y.. tt»(tt»-l) 

■^3! 2 4! 

— 1) (tt» — 4) A’y.> 4- A*y,, 

■*"5! 2 


This formula gives the approximate value of the integral from x — Xo — k 
to X » Xo -f- A. By advancing the subscripts of the y’s by one unit we get 
the value of the integral from x — Xo to x — Xo + 2A. Denoting this 
integral by /o*, we have 

- 2* [y. + -1 A-y. - A<y-, + Ay. ] . 

The integrals It*, //, - ■ ■ I \.2 are likewise seen to be 

/.• - 8* [ y. + iAy - 4 Ay + ^ Ay] , 

[ ,. + 1 Ay - ^ Ay + ^ ^y ] . 


/V. - « [ y^ + - iJo* V. + 1^ • 



An. 53] CENTRAL-DIFFERENCE FORMULAS 

Adding all these separate integrals, we get 


145 


(68. 1) /«• — [^yi + y. + y. + • • • + y-i 

4- 1- ( A*yo + A*y. + • • • + 

“llo + • • • + ^*y-») 

+ ( A*y-a 4- A*yo 4- A*y, 4- •• • 4- A«y»-4)] , 

where 

ydx 

■» 

and n is even. 


Integrating Bessel’s formula (VI) over the interval x — Xq to a; So 4* 
or V — — J to V — we have 

V - - »x: ^ 

J_v(v* — i) . . (o* — — i) AV2 4-A‘y-, 

4- 3j Ay..+ , 2 


5! 


, ( o* — !)(<>* — t)(t>* — V) A*y., 4- AV, . . . . \ j- 
■^61 2 


K rJJliJ'i _ L A*y-t + A*yo 11 A«y., + A*y., 
"L 2 12 2 *^720 2 

_ 191 AV, 4- A*y-» "l 
60480 2 J ’ 



146 NUMERICAL INTEGRATION [Chap. VIII 

By advancing the subscripts a unit at a time we find the integrals over 
the succeeding intervals to be 

ft h ^*y<*~^^*y* I ^*y-i ~f~ ^*yo 

* “ L 2 12 2 ”^720 2 

_ 191 A«y., 4- A«y.t -| 

60480 2 J ’ 


T s h - — ^*y» + ^*y* JL JJ_ + ^*y« 

*““12 12 2 ^ '5'20 2 

_ 191 A*y.t + A«y, -| 

60480 2 J ’ 


rn K r ^"-* + y* - — ^"y»-g + ^*y"-« I A«y,., + A<y,.» 

I •-! — ft L 2 J2 2 *^720 2 

_ 191 A«y,.« + A”y.-3 l 

60480 2 J ‘ 


Adding ail these separate integrals, we get 
(63.2) /o" -= h + yi + y* + ■ ■ ■ + y*-! 

+ iV. + ^) 

+ A-,...+ ^) 



An. 53] 


CENTRAL-DIFFERENCE FORMULAS 


147 


where n is now either even or odd. 


It will be observed that formulas (63.1) and (63.2) involve only 
differences of even orders, that (63. 2) involves all the even differences, 
whereas (63.1) involves only half of them. Formula (63.2) is too 
cumbersome for practical use as it stands, but it can be transformed into 
a much simpler and more useful form, as we shall now show. 

From the definition of differences we have 


A*y-, — Ayo — Ay.i, 


A*yo — Ayi — Ayo, 


A*y,_i — Ay* — 

A*y.i — A*y-i — A*y.i> 

A<y i — A»yo — A»y.i, 


A*y».i — A*y,.i — 

A*y., — A‘y., — A»y.„ 

A*y.* — A'y., — A»y.„ 

A*y,., — A*jfn.s — A*y,.„ etc. 


Substituting in (63.2) these values of the even differences, we find that 
all differences except those at the beginning and end of the table cancel 
one another and that formula (68.2) reduces down to 



148 


NUMERICAL INTEGRATION 


[Chap, vni 


11 / A»y-, + A«y.A 191 / A»y., -f A»y.,\ 

“ 720 \ 2 / 60480 V 8 / 

_ 1 / Ay^t + Ay. \ , 11 / A»y^, + A»y^i \ 

12 \ 2 /"^720V 2 / 


191 

60480 




9 


which can be written in the simpler form 


(58. 3) /o* — h yi + y* + ‘ ‘ ' + y*-i + 

^ / ^yn-i ~l~ ^yn _ Ay.i -4~ ^yp \ 

12 V 2 2 / 

11 / A«y,., + A»y,.i A»y., + A»y.t \ 

720 V 2 2 / 

_ 191 / A»y,-» + A»y,.» _ A»y., + A»y., 

60480 \ 2 2 


The results given by this formula are identical with those given by 

(53.2) , but the labor involved in obtaining them is only a small fraction 
of that required when using (53.2). 

The geometric significance of formulas (53.1), (53.2), and (53.3) 
should be noted. Formula (53.1) replaces the graph of the given func> 
tion by n/2 arcs of polynomials of the sixth degree, whereas (53. 2) and 

(53. 3) replace the graph by n arcs of sixth-degree polynomials. 

By neglecting fourth and sixth differences in (53.1) and replacing the 
second differences by their values in terms of the y*s, we shall find that 
(53. 1) then reduces to Simpson’s Rule. This formula therefore represents 
Simpson’s Rule with correction terms. 

We shall now apply (53.1) and (53.3) to two examples. 

Example 1. Compute the value of w from the formula 

4"" J, TT^- 

Solution. We first compute the values of the function y — 1/(1 -f- a^) 
from X"» — 0.3 to x — 1.3, taking h — 0.1, and then form a table of 
difference as shown on the following page. 



Am. 63] CENTRAL-DIFFERENCE FOtlMULAS 148 

Substituting in (68.1) the appropriate differences, we have 

^ - 0.2[8.93H578 -f | (- 249992) _ A (_ 7 ) + j^(778)] 

— 0.78639816. 

.*. IT — 4 X 0.78639816 — 3.14169264. 

The true value of w to nine figures is 

ir — 3.14169265. 


Difference Table for ■ 1 /(I -t-z*.) 


X 

y 

Ay 

AV 


A*y 

A*y 

A^ 

-0.3 

0.9174312 

441073 






-0.2 

0.9615385 

285605 

-155468 

-31127 




-0.1 

0.9900090 

* 

-186505 


-1-19702 





99010 


-11425 


-1-3148 


0 


- 99010 

-198020 

-i- 11425 

+22850 

-3148 

-6206 

0.1 

0.9900990 

-285605 

-186595 

+31127 

19702 

-7910 

-4762 

0.2 

0.9615385 

-441073 

-155468 

42919 

11792 

-9230 

-1320 

0.3 

0.9174312 

-553622 

-112549 

45481 

2562 

-7344 

-1-1886 

0.4 

0 8620690 

-620600 

- 67068 

40699 

- 4782 

-4021 

+3323 

0.5 

0 8000000 

-647059 

- 26369 

31896 

- 8803 

- 936 

3085 

0.6 

0.7352941 

-641532 

+, 5527 

221S7 

- 9739 

-i-1047 

1983 

0.7 

0 6711409 

-613848 

+ 27684 

13465 

- 8692 

-1-1915 

868 

0.8 

0.6097561 

-572699 

41149 

6688 

- 6777 

2001 

86 

0.9 

1 

0 5524862 

-524862 

47837 

1912 

- 4776 

1702 

- 299 



-475113 

49749 

- 1162 

- 3074 

1286 

- 416 

1.1 

0. 4524^7 

-426526 

48587 

- 2950 

- 1788 



1.2 

0.4008361 

-380889 

45637 





1.8 

0.3717472 















160 NUMERICAL INTEGRATION [Chap. VIII 

Substituting in (53.3) the appropriate differences from the table, we 
get 

^-0.1[7.8498150-^ (-499988) + ^ (376) 

1Q1 

» — 4 X 0,78539817 — 3.14169268. 

This value is slightly less accurate than that obtained by (63.1), but 
either result is correct to as many figures as were used in the computed 
ordinates. 

Simpson’s Hule gives for this problem the value 

3.14169260, 

which is likewise correct to as many figures as are given in the computed 
ordinates. 

Example 2. Compute the approximate value of the integral 



Solution. Taking A — 0.1, we compute the values of y — 1/x at one- 
tenth unit intervals from z — 0.7 to z 2.3 and form a table of differences. 

Substituting in (53.1) the appropriate differences, we get 

I _ 0.2 [3.46953943 | (3727034) — (281363) 

+ 7^ (61266)] — 0.693147185. 

IdliO —————— 

The correct value is In 2 — 0.693147181. 

Substituting in (63.3) the appropriate differences from the table, we 
get 

I — 0.1 [6.93771403 — ^ (7594744) + ^ (693339) 

1Q1 

—77^ (136810)] — 0.69314714. 

60480 ' 

• 

It will be seen that formula (63.1) gave the more accurate value in 
this example as was the case in the preceding. 

Concerning the relative merits of formulas (53.1) and (63.3), it may 
be said that (63. 1) converges more rapidly and is therefore slightly more 



Art. 53 ] 


CENTRAL-DIFFERENCE FORMULAS 


151 


accurate. It utilitizes fewer ordinates outside the range of integration 
than does (63.3). Formula (63.1) requires that the number of sub- 


Difference Table for y’^lfx. 


X 1 

y 


A*v 

A*y 




0.7 

1.42857143 

-17857143 


1 






0.8 

1.25000000 

-13888889 

3968254 

_ 

1190476 





0.9 

1.11111111 

-11111111 

2777778 

_ 

757576 

432900 


180375 


1.0 

1.00000000 

- 9090909 

2020202 

_ 

505051 

252525 

* 

97123 

83252 

1.1 

0.90909091 

- 7575758 

1515151 

_ 

319649 

155402 


55505 

41618 

1.2 

0.83333333 

- 6410256 

1165502 

1 

! 

249752 

99897 


33293 

22212 

1.3 

0.76923077 

- 5494506 

915750 

_ 

183148 

66604 


20821 

12472 

1.4 

0.71428571 

- 4761904 

732602 

_ 

137365 

45783 


13459 

7362 

1.5 

0.66666667 

- 4166667 

595237 

_ 

105041 

32324 


8981 

4478 

1.6 

0.62500000 

- 3676471 

490196 


81698 

23343 


6147 

2834 

1.7 

0.58823529 

- 3267973 

408498 

_ 

64502 

17196 

_ 

4292 

1855 

1.8 

0.55555556 

- 2923977 

343996 

_ 

51598 

12904 

_ 

3077 

1215 

1.9 

0.52631579 

- 2631579 

292398 


41771 

9827 


2234 

843 

2.0 

0.50000000 

- 2380952 

2^27 


34178 

7593 


1645 

589 

2.1 

0.47619048 

- 2164503 

216449 

_ 

28230 

5948 




2.2 

0.45454545 

- 1976284 

188219 







2.3 

0.43478261 










intervals be ever^ and also requires a little more labor in its application 
than does (63.3). 

Formula (53.3) has the advantage of being applicable to any number 
of subintervals and of requiring very little labor in its application. It 
gives the same degree of accuracy with third or fifth differences as 











lo2 


NUMERICAL INTEGRATION 


[Chap. VIII 


(63. 1) gives with fourth or sixth differences. Its chief disadvantage is 
that it utilizes several ordinates outside the range of integration. 

The extra-interval ordinates required in formulas (63.1) and (63.3) 
can usually be found by computation, as in the examples worked above, 
or by extrapolation by means of Newton’s formulas (I) and (II). Usually, 
however, it is not safe to use extrapolation for finding more than one 
ordinate at each end of the range. 

54. Gauss’s Quadrature Formula. The most accurate of the quadra- 
ture formulas in ordinary use is known as Gauss’s formula. In Simpson’s 
and Weddle’s formulas the ordinates are equally spaced, but it occurred 
to Gauss that some other spacing might give a better result. Hence he 
set for himself this problem; 

If the definite integral f^f(x)dx is to be computed from a given number 
of values of f(x), just where should these values be taken in order to get 
a result of the greatest possible accuracy? In other words, how shall the 
interval (a, 6) be subdivided so as to give the best possible result? 

It turns out that the points of subdivision should not be equidistant, 
but they are symmetrically placed with respect to the midpoint of the 
interval of integration. 

Let I — f^ydx denote the integral to be computed, where y — f(x). 
On changing the variable by the substitution 

(1) X— (6 — o)u-|- 

the limits of integration become — ^ and The new value of y is 
y^f{x) — /[(& — a)M + ^-^] — ^(u), say. 

Then since dx— (6 — a)du, the integral becomes 

(2) /-(6-a) s: <f>{u)du. 

Gauss’s formula is 

(64. 1) / — r ^(tt)du •— Riilt(Ui) -f- R2>^(‘U>z) + + ■ ■ ■ + Rn<l>(Un), 

-I 

where Ui, Ut,- ■ • u, are the points of subdivision of the interval u — — ^ 
to u « The corresponding values of x are therefore 

* 1 — (6 — o)tti -1- ^ - - , X 2 — (6 — o)u 2 + etc. 



Abt. 54] 


GAUSS’S FORMULA 


168 


The value of the integral f*f(x) is therefore 

( 64 . 2 ) / — J* f(x)dx — (6 — a)[i2i^(u,) + + • • • 


We shall not give a detailed derivation of Gauss’s formula (64.1), but 
merely show how the values of Ui, Uj, * * ■ Un and Ri, Rt,' ' • Rn are found 
and then show how to apply it to an example. 

We assume that ^(u) can be expanded in a convergent power series 
in the interval u — — ^ to « — Hence we write 


(3) ^(«) — Oo 4- OiW + Oatt* + a»u* + • • • 4- 4- ' • • 

We also assume that the integral can be expressed as a linear function of 
the ordinates of the form (64.1). Integrating (3) between the limits 
— i and we have 


(4) / — J"! »lt(u)du — 4- «!« + • • -4- 4- • • • 


)du 


From (3) we also have 

^(Ut) flo 4" 4" OaUi* 4" 4* 4" ' ■ ■ 4* OmWi"* 4* " ' ' > 

«^(Ua) — Oo 4- O1W2 4- a 2 « 2 * 4* 08«2* 4- O4U2* 4 - • ■ • 4- OmUa"* 4- • • • , 


<ft(Un) — Oo 4- ®i«n 4- 02tti.* 4- 4" • • 4- a^Un^ 4" ‘ ‘ • 

Substituting in (64.1) these values of ^(u,), ^(u^), ■ • -^(u^), we get 

I — Riioo 4- aiV’i 4- a2«% 4- * * ’ 4- Omur 4- ’ ' 0 
4" f^2(0o 4" Ol^2 4" <*2^2* 4" ■ ' ’ 4" Omt*2"* 4* ■ ■ ■) 


4" R»(fio 4” 4" ®2^* 4" ' ' ' 4" 4" ' ’ 

or, rearranging, 

(5) I (io{Rt 4* -^2 4" ^3 4" ■ ' 4" -B*) 

4" ®i(Bitti 4” BaUf 4" ’ * " 4* Bull*) 

4" <h(Ri^i* 4" Batta* 4" ’ ■ ■ 4" B»tt»*) 


4- o«,(B»«»'» 4- Batta- 4- • • • 4- B,tt.-) 



154 


NUMERICAL INTEGRATION 


IChap. VIII 


Now if the integral / in (5) is to be identically the same as the I in 
(4) for all values of Aq, Ai, etc.; that is, if (5) is to be identical with 
(4) regardless of the form of the function ^(u), then corresponding 
coefficients of Ao, Ai, A2, etc. in (5) and (4) must be equal. Hence we 
must have 


(54.3) ■{ 


Rl -f- iJj -f- ’ ■ "}" Rn 1> 

Ri^i R 2 U 2 *4“ fiaUs ■}" • • • R^ltn 0, 

RiUi* -|- R2U2* R2U2* ' “H RnUn* “ 

+ i?3«s* + • • • + RnUn* — 0, 

RlUi* + RiUt* -1- RiUi* + • • • + RnUn* — , 


By taking 2n of these equations and solving them simultaneously, it 
would be theoretically possible to find the 2n quantities «i, U 2 , * * • Un and 
Riy R2,' ’ ‘ Rn- However, the labor of solving these equations by the 
ordinary methods of algebra would be quite prohibitive even for small 
values of n. Fortunately a formula from higher mathematics makes such 
labor unnecessary. 

It can be shown * without difficulty that if ^(u) is a polynomial of 
degree not higher than 2» — 1, then Ui, U 2 ,- • • «n are the zeros of the 
Legendre polynomial Pn{u), or the roots of Pii(u) —0. These roots are 
conveniently found from the equation 

( 6 ) 

The n roots Ui, U 2 , * * * uw of this nth-degree equation are all real. On 
substituting them in (54.3), we can find the R*b. We shall do this for 
the case n i— 3. 

The equation to be solved is 

£2 [“* - (i)*]* - O’ ^ “ 0- 

Performing the differentiations and simplifying, we get 
M(20tt* — 3) — 0, from which 
n — 0, ± iV3/5. 

Hence 

lit 3/5, Tia ““ 0, tig 3/5. 


* See, for example, Todhunter’a Functions of Laplace, Lam6, and Bessel, p. 99. 



Art. 54] GAUSS'S FORMULA 156 

Then from the first three of equations (64.3), we have 

jRl R 2 “f" 1 

R.(— -f- - 0 

Rx(— i VVS )* + i 2 ,(i VVS )* - 1/12. 

Solving these equations, we find 

Ri — 6/18, R 2 = 4/9, R 2 — 6/18. 

It is to be noted that the u’s are symmetrically placed with respect to 
the midpoint of the interval of integration and that the i2’s are the same 
for each symmetric pair of u’s. Hence from now on the u’s of the points 
of division will be designated by Vo for the midpoint, Un for the pair of 
symmetric points nearest the midpoint, u .2 for the next pair of symmetric 
points, etc. 

The numerical values of the i*’s and corresponding R's for n — 2 to 
n 10 are given in the table below, where the notation of the form 
“ N means = N, u.jc — — N. 



u 

R 

n — 2. 

= 0.2886751346 


n — 3. 

tto — 0 

i? — 4/9 


u,i — 0.3872983346 

R — 5/18 

n — 4. 

u,i — 0.1699905218 

R — 0.3260725774 


— 0.4305681568 

.B — 0.1739274226 

n ' 6. 

e 

1 

0 

• 

R _ 64/225 


— 0.2692346551 

R — 0.2393143352 


u., — 0.4530899230 

B — 0.1184634425 

n 6. 

— 0.1193095930 

B — 0.2339569673 


u ,3 — 0.3306046932 

R — 0.1803807865 


u„ — 0.4662347571 

R — 0.08566224619 

n — 7. 

1 

0 

B = 0.2089796918 


— 0.2029225757 

B — 0.1909160253 


tt„ — 0.3707665928 

B — 0.1398526957 


u„ — 0.4745639562 

B — 0.06474248308 


11 



166 


NUMERICAL INTEGRATION 


[Cbap. VIII 


« B 

n»8. 

V.1 — 0.0917173212 R — 0.1813418917 

Ua — 0.2627662050 R — 0.1568533229 

V., — 0.3983332387 — 0.1111905172 

v*« — 0.4801449282 R — 0.05061426815 

n — 9. 

Uo — ■ 0 

— 0.1621267117 

— 0.3066857164 
tt*. — 0.4180155537 
u,4 — 0.4840801198 

n — 10. 

— 0.0744371695 R — 0.1477621124 

tt., — 0.2166976971 R — 0.1346333597 

— 0.3397047841 R — 0.1095431813 

U.4 — 0.4325316833 B — 0.07472567458 

u.., — 0.4869532643 ii — 0.03333567215 

Note. For farther tables relating to Gauss’s quadrature formula the 
reader should consult the following literature: 

1. "Table of the Zeros of the Legendre Polynomials of Order 1-16 and 
the Weight Coefficients for Gauss’s Mechanical Quadrature Formula,” by 
A. N. Lowan, Norman Davids, and Arthur Levinson, Bulletin of The 
American Mathematical Society, Vol. 48, No. 10, pp. 739-743, October 
1942. 

This is the most extensive table of the zeros and Gauss coefficients that 
has yet been published. All numbers are given to 15 decimal places. This 
table gives the subdivision points for the interval ii» — 1 tou — 1, and 
the values of u and R must be divided by 2 to agree with those given in 
the table above. 

2. Valeur Approximative d'une Integrate Definie, by B. P. Moors, Paris, 
1905. This is the most comprehensive work on approximate quadrature 
that has ever been written. The roots and coefficients for the Gauss formula 
are given forn — ■ 1 to n — 10 to 16 decimal places. The complete formula 
and its graphic representation are also given for each value of n. 

We shall now apply Gauss’s formula to a simple example. 


R — 0.1651196775 
f? — 0.1561735385 
iS — 0.1303053482 
R — 0.09032408035 
R > 0.04063719418 



Ast. 64] 


GAUSS'S FORMULA 


167 


Bzample. Compute the integnl 



Solution. Here we put 

X— (b — a)tt -f - - —7tt 4-8.6 




Taking n^B, we have 



yo — «(Uo) — 

^ — 0.117647069 

0.5 


yi — ^(tti) — 

1 _ 

1 

-0.0962960439 

7ttx 4- 8.5 " 

10.3846426 ' 

y.x— ^(tt-,) - 

1 

1 

— 0.151163412 

' 7ii.i 4- 8.6 " 

6.61535741 

y» ™ “ 

1 _ 

1 

— 0.0856778399 

7tt, 4- 8.5 

11.67162946 

y-, — ^(ti-,) - 

1 

1 

— 0.187674636. 

‘ 7ti^, 4- 8.5 " 

' 5.32837054 


Substituting these values in (64.2), together with the corresponding 
R’b for n -i- 5, we get 

/ — 7[ ^ X 0.117647059 4- 0.2393143352(0.151163412 4- 0.0962960439) 

4- 0.1184634425(0.187674636 4- 0.0856778399)], or 
la — 0.876468458. 

The true value of the integral is 

/-. I In 2.4 — 0.876468737. 

Jt X 5 

The error is therefore 

£o — 0.00000028. 

The value of this integral by Simpson’s Rule, using fifteen ordinates, is 

la — 0.87547189. 

The error in this case is therefore 


^5 — 0.0000034, 



168 


NUMERICAL INTEGRATION 


[Chai-. VIII 


or more than ten times as great as with Gauss’s formula. The labor 
required to find the integral by Gauss’s formula is, however, many times 
as great as with Simpson’s unless a computing machine is used. 


Note. The distance in x-units of any point U{ from the midpoint of 
the interval of integration is (6 — a)tti. 

To prove this, let h — L and take the midpoint of the interval as 
origin of coordinates. Then the limits of integration for x are — L/% 
and L/2. On substituting — L/2 for a and L/2 for h in the trans- 


formation formula i=(6 — a)u-}- 


0 - 1-6 

we get 


Lu=~ (6 — a)u 


Gauss’s formula is useful for another purpose besides computing definite 
integrals. Recalling that the mean value of a function is given by the 
formula 



we see that the accuracy of the mean depends on the accuracy with which 
the integral j^ydx can be computed. The most accurate value of this is 
obtained by measuring ordinates at the points given by Gauss’s formula. 

Thus, if we wished to find the best value for the mean daily tempera- 
ture from only four measurements, we would proceed as follows: 

Denoting temperature by T, the hour of the day by t, and taking noon 
as the middle of the day, we have 


r -/(<), 



Taking noon as the midpoint of the 24-hour period and remembering 
that the time measured from noon is 24u, we have for n — 4, 

— 24iij — 24(0.16999) =* 4.‘'0798 = 4'* 4.'"8 
t.i — 24tt-, — 24 (— 0.16999) — — 4'> 4.">8 
tg — 24 u2 — 24(0.430568) — 10.**3336 — 10‘‘^20"* 
t.2 — 24tt.2 — 24 (— 0.430568) — — lO** 20"*. 

The best times during the day to take measurements are therefore 
1:40 A.M., 7:55 a.m., 4:05 P.M., and 10:20 p.ic. 



Art. 651 


LOBATTO’S FORMULA 


159 


In a similar manner we could find the best times of the day for making 
five, six, or any other number of measurements by taking the proper «*8 
for n — 5, 6, etc. 

The same method can be applied for finding the best positions or times 
for taking measurements on any other physical quantity. 

Remarks. 1. The reader should bear in mind that Gauss’s formula gives 
an exact result when f(x) is a polynomial of the {2n — l)th degree or lower. 

2. Although Gauss’s method is theoretically beautiful and of great 
accuracy, it has the disadvantage of being laborious in its application, for 
three reasons: 

(a) If the limits of the given integral are not — ^ and -J, the integral 
must be transformed to one that has these limits. The transformed integral 
is usually more complicated than the given one. 

(b) If the values of y are to he computed from a formula, the numerical 
values of u to be substituted in the formula must be given to at least as 
many significant figures as we wish to obtain in the y’s. 

(c) After we have found the y's to the desired number of significant 
figures we must multiply them by K's having at least as many figures. 

Gauss’s formula thus compels us to deal with large numbers in every 
step if we desire the accuracy it is capable of giving. In applying this 
formula it is therefore imperative that we use every available aid for 
reducing the labor of computation. Whoever doubts this statement has 
only to work out a simple example to be convinced. 

3. Gauss’s formula should be used for computing definite integrals only 
when few ordinates are obtainable or when the importance of the result is 
such as to justify a great expenditure of labor. 

55. Lobatto’s Formula. The Reader will have noticed that Gauss’s 
formula does not contain the values of the function at the end points of 
the interval of integration. In certain types of problems it is highly 
advantageous to utilize the end values of the function. Lobatto * therefore 
modified Gauss’s formula so as to include the end values and also the value 
of the function at the midpoint of the interval. The modification con- 
sisted in finding the points of subdivision (including the midpoint and 
the end points of the interval) from the equation 

( 1 ) 

where n denotes the total number of values of the function to be utilized 

* Leeam over do Integrool-Reikening, {207-210. The Hague, 1852. 



160 


NUMERICAL INTEGRATION 


[Chap. VIII 


(including end values and midpoint values). The values of u found from 
this equation are then substituted in equations (64.8) to find the corre- 
sponding R*a. We shall derive Lobatto^s formula for the case n — 6. 


When n— *5, (1) becomes 


(*).). _o. 


or 

^[u«-4»*(l)‘ + 6u*(})<-4ll*(i)*+ (i)*] -0. 


On performing the indicated differentiations and simplifying, we get 


«(112«* — 40tt»-|-3) — 0, 

from which 

tt — 0, ± ^V3/7, ± I. 

Hence 

tta- — iV3/7, tt, — 0, tt, — iV3/7, tt, — J. 


On substituting these values of u in the first five of equations (64.3) and 
solving for the R% we find 

/?, == 1/20, /J, — i?, — 49/180, i?, — 16/45. 

Hence Ijobatto’s formula for n — 5 is 

I •=* (b — a) yo + ^ (y-i + yO + ^ (y-2 + y2)J . 

The values of u and R for several values of n are as follows : 


tt 

n — 3. 

tto — 0 

n — 5. 

tto — 0 

tt,i — iV3/7 — 0.327327 

. 1 — 0.6 

n — 7. 

tto — 0 

tt,t — 0.2344245 
tto, — 0.415112 

tt,, — 0.6 


R 

iJ — 2/3 
5—1/6 

5 — 16/45 — 0.355556 
5 — 49/180 — 0.272222 
1/20 — 0.05 

5 — 0.2438097 
5 — 0.2168727 
5 — 0.1384129 
5 — 0.02380961 



LOBATTO*S FORMULA 


lei 


kn. 65] 


n — 11 . 


« 

tto — 0 

tt»i — 0.1815585 
11,1 — 0.338593 
ttu — 0.449879 
11,4 — 0.5 

iio — 0 

if,i — 0.147876 
11,1 — 0.882617 
tfu — 0.392242 
11,4 — 0.467000 
ii,s — 0.5 


B 

J2 — 0.1857593 
i?- 0.1732142 
ie — 0.1372695 
J2 — 0.08274774 
R — 0.0138889 

B — 0.1501037 
B — 0.1434397 
B — 0.1240270 
22 — 0.09358390 
R — 0.06480614 
R — 0.009091366 


These values are to be substituted in formula (54.2). 

Lobatto*s formula is less accurate than Gauss’s for the same n, but it is 
frequently more convenient for use. If the function happens to be zero 
at the ends of the interval of integration, as is frequently the case, the 
end values do not have to be computed. The computation is thereby 
shortened. 

Example. Compute by Lobatto’s formula the value of the integral 

J Ml^ 

• * ' 

for n — 6 . 

Solution. The integral must first be transformed so that the limits 
become — ^ and -I, just as was dotie when using the Gauss formula. From 
the previous example we have 

* — 7it + 8.5, y — 1/*, — 8.5 "" *(“)• 

Hence 

yi — ♦(iti) — 7(0.327327) + 8.6 “ 10.79129 “ ® ®®266726 
y.i — 7 (_ 0.337327) ^ 3.5 6.20871 



162 


NUMERICAL INTEGRATION 


[Chap. VIII 


y» — ^(«2) — ^ g g — 

./..V 1 _ 1 

y-j — — 7 (_o.5) +8.6 “ 6 * 

Hence 

The error in this value is 0.00015. 

56. Tchebycheff *s Formula. Tchebycheff * devised a quadrature formula 
in which the coefficients of the y’s are all equal. His formula is 

(1) (i/n)[^(«i) +^( m,) +^( fi,) +• • • + ^(tt«)], 
and therefore 

(2) f +^(u,) +^(tt,) +• • • + <►(«»)]. 

«/a n 

The points of subdivision of the interval of integration are symmetrically 
placed with respect to the midpoint of the interval. The u*s are the zeros 
of certain polynomials, the first few of which, equated to zero, are : 

(2»)»— 1/3 — 0 

(2tt)» — i(2«)-0 

(2u)‘ — (2/3) (2a)» + 1/45 — 0 

(2u)'‘ — (6/6) (2u)* + (7/72) (2tt) — 0 

(2u)‘ — (2tt)« + (1/5) (2tt)» — 1/106 — 0. 

The values of the u’s for n — 2 to n — 7 and n — 9 are : 

n — 2. tt,.— 0.288675 
n — 3. Uo — 0, — 0.353553 

n — 4. — 0.0937962, — 0.397327 

n — 5. u, — 0, u^x — 0.187271, u*, — 0.416249 
n — 6. u,x — 0.133318, u,, — 0.211259, u,. — 0.433123 
n — 7. Uo — 0, Uti — 0.161956, ttt 2 — 0.264828, tf*! — 0.441931 
fi ■■■ 9. iiq 0, u»x 0.0839531, u,2 0.264381, Utg 0.300509, 

u,4 — 0.455795. 

* P. Tchebichef, “ Sur les Quadratures." JounuU de Mathematiquea, 1874, p. 19. 
Some of the numerieal values oa p. 25 of that paper are ineorreet. 



Art. 66] 


TCHEBYCHEFF’S FORMULA 


168 


Tchebycheff's formula can be derived directly in terms of the j^s at 
the points of subdivision, by means of the polynomial function 

( 3 ) y ■“ ®o *{■ ” 1 “ fla®* "f" Ui®* “I" O435* -f- -f” UeSE* -|- a -f- AiX*. 

In deriving the formula we require that two conditions be met: (1) the 
coefficients of all y’s are to be the same, and (3) the points of subdivision 
are to be symmetrically situated with respect to the midpoint of the interval 
of integration. 

Take the midpoint of the interval as origin, let Xi, — x,, X 2 , — Xt, etc. 
denote the distances from the origin to the points of subdivision, and let 
yi, — yi, y», — yt, etc. denote the corresponding functional values. Then 
if A denotes the area under the graph of (3) in the interval of integration, 
it is evident that for the case of nine subdivision points the required con- 
ditions are satisfied by the equation 

(4) A— k{yo 4- yi + y-i + y 2 + y -2 + ys 4- y-s 4- y* 4- y-*\ 

where k denotes the common coefficient of the y’s. 

Let I denote the length of the interval of integration b — a. Then the 
area under the graph of (3) is 

/’!/• /•!/* 

(6) 4—1 ydx^ I (Oo 4- ®i® 4- ®2®* 4* ®»®* 4- ®«®* 4* <>s®* 

*7-1/1 .7-1/1 

4 - o*®* 4 " ®T®^ 4- Oi®*)dx 

^ 4. ^ 4. ^ 4. ^ 

^ ^ 12 ^ 80 ^ 448 2304' 

To find yo, yi, y-i, etc., we put x — 0, x„ — x„ etc. in (3) and get 


yo — Oo 

yi + y_i — 2 (oo 4 - o,x,* 4 - 4 - o,Xi* 4 ’ o,x,*) 

ys 4- y -1 — 2 (flo 4- 0 |® 1 * 4- a*®i* 4- 0 «® 1 * 4- as®!*) 

etc. 


Substituting into (4) these values of the y’s, we obtain 

(6) A — fc[9ao 4- 2o,(xi* 4- ®i* 4" ®t* 4- ®4*) 4- M®!* 4* ®i* 4- ®s* 4- ®«*) 
4- 2a,(x,* 4- » 1 * 4- ®«* 4- »4*) 4- 2a«(®i* 4* ®a* 4- ®s* 4- ®4*)3 • 



164 


NUMERICAL INTEGRATION 


[Chat. VIII 


Now equating the ooeflteients of the a*a in (6) and (6), we find 



(V 


' on 

®i* + *•* + ®»* "I" “ j4 

* 1 * +*■•+*••+ ^ 

*.* + *,* + *.- + */-^ 


The eolation of the qrstem (7) is not ea^, bnt the reader can easily 
yerify that the four equations are satisfied by the values 

Zi— 0.0839511, z, — 0.2643811, z. — 0.3006091, z«« 0.4657951 

Hence the u*b forn«— 9 above are Ui^Zi/l, etc. 

Tchebycheff’s formula in terms of the y*B directly is thus, for n ■— 9 : 

( 8 ) 4 — 1/9 (yo + yi + y-i + yt + y-* + y. + y-. + y4 + y-4 ), 

where the j/s must be measured at the subdivision points of the interval 
of integration. 


Because of the fact that the functional values all have equal weight (the 
same coefiicient) in Tchebycheff’s formula, this formula is particularly 
appropriate for use when the functional values are found by measurement; 
for in that case the positive and negative errors of measurement will largely 
cancel one another. Tchebycheff’s formiUa would be ideal for finding areas 
from drawings if it were not for the fact that the points of division are not 
readily located. Even so, these points can probably be located with an 
accuracy equal to that of the drawing. 

Example. Compute by Tchebycheff's formula the value of the integral 



for n — 6. 


Solution. Here we must express z in terms of u as in the two preceding 
examples. We have 



^(tt) 


z — 7a + 8.6, y — 1/z 


7a + 8.6 



Aet. 57] 
Then 


EULER'S FORMULA 


IW 

♦(«.)- ^-0.117647 

“ 7(0.187271) + 8.6 ” 9.810897 
"" 7(— 0.187271) +8.5 7.189103 “ 

" 7(0.416249) + 8.6 11.413743 0 

“ 7(— 0.416249) +8.5 " 5.586257 "" 

Hence 

I — y(0.117647 + 0.101927 + 0.139099 + 0.087614 + 0.179011) 

— 0.875417. 

The error in this result is 0.000051. 

67. Euler’s Formula of Summation and Quadrature. The approxi- 
mate relation between integrals and sums is expressed by Euler’s summa- 
tion formula. Written as a quadrature formula it is * 

+/(*.) +/(*.)+• • •+/(i...) + ^] 

(>) -^crw-mi+^crw-rw] 

- im f«‘) -«*)3 + ii» v-w -/-(.)] — •] 

• • • + E. 

It will be noted that the top line of (1 ) constitutes the Trapezoidal Rule. 

By adding and subtracting h[f (xo)/2 + /(a:n)/2] on the right-hand side 
of (1) we have 

-»[/(»,) +/(i.) +• • •+/(j’.)]— I- [/(*.) +/(*.)] 

• •• 

* For the derivation of Euler’s formula see Vall^-Poussin’s Coura d^Analyae In- 
finiUaimale, II, p. 341; Whittaker and Robinson’s Calculua of Obaervationa, p. 134; 
or Charlier’s Mechanik dea Himmela, II, Sl« 



lee 


NUMERICAL INTEGRATION 


[Chap. VIII 


Transposing and dividing through by h, we get 

«*.)+«».)+• ■ ■ + [/(*.)+/(*.)] 

+ ^ [f (») -f (<■)] - ^ tr (») -r(«)] + . 

or, since Xo-^a, Xn^b, 

pM - jSyM‘^+ i [/(•) +«»)] -fM 

Formula (2) is Euler's summation formula. It is useful for finding 
the approximate sum of any number of consecutive values of a fiuirtui.'i 
when these values are given for equidistant values of x, provided the 
integral f^f(x)dx can be easily evaluated. In these formulas h is the 
distance between the equidistant values of x, so that nh^b — a. 

Note. Formulas (1) and (2) differ in an important respect from the 
quadrature formulas previously derived. In (1) the terms on the right- 
hand side, beginning with (A/12) [f{b) — fia)"], form masymptotic series. 
The same is true of (2), ^ginning with the term {h/lZ){f{h) — /'(<»)]• 

An asymptotic series is an infinite series which converges for a certain 
number of terms and then begins to diverge. In computing with such a 
series it is important to know what term to stop with in order to get the 
most accurate result. We should stop net with the smallest term but witli 
the term just before the smallest', for the error committed is usually less 
than twice the first neglected term * and is therefore least when’ the first 
term neglected is the smallest term in the series. For the reason just given 
it is important that Euler’s formula be used with caution, especially when 
finding sums by (2). We shall now apply each of these formulas to an 
example. 

Example 1. Compute the value of ir from the formula 

ir dx 

T" J, T+^' 


* See Charlier, loe. eit., p. 14. 



Art. 67 ] 


EULER'S FORMULA 


107 


Solution. We take A — ^ and compute the values of y*— 1/(1 
at each point of subdivision, as shown in the table below. 


X 

y 

X 

y 

0 

1 


0.69230769 

i 

0.97297297 


0.59016393 


0.9 

1 

0.5 

1 

0 8 




We next compute the derivatives of 1/(1 + **), as given below. 

1 


/(*)- 

f(x)- 


l+x»* 

2x 


(l + x») 


» 


' ' ' (i+i*)* ’ 

r-i*) [W- 49** + 49*'- r]. 


Hence 


m-0, 

1 

1 

r(o) -0, 

r(i)-o. 

f'(0)-0. 

/"(!)- 15, 

tHO) -0, 

/»“(!) —0. 


Substituting all these values in (1), we get 

[0.75 + 0.97297297 + 0.9 -f 0.8 + 0.69230769 + 0.69016393] 

- 0.9 (16) —0.78539816, 

36X12\ 2/ 6* X 30240 ' ’ • 

which is correct ho its last figure. 

Example 2. Find the sum of 








168 


NUMERICAL INTEGRATION 


(Chap. VIII 


Solution. Here f(x) — 1/x* and A — 2. Then 

f (*) — §!. r(») — /•(*) — f . r(*) 


40320 
»• * 


Bemembering that a -i- 51, 5 99, and substituting in (2), we get 



15L61» 99*J^2lL6r 99* J 

rj 1 

15 L51* 99* J 


— 0.004753416 + 0.0002432490 

+ 0.0000021694 — 0.0000000008 

— 0.004998833. 


If we had attempted to find the sum of the squares of the reciprocals 
of all the odd numbers from 1 to 99 we could not have obtained it accurately, 
for each bracketed quantity after the second would have been practically 
unity and therefore the various terms would have been the same as the 
coefficients 4/15, 16/21, 64/15, etc. To get the greatest accuracy in this 
ease we should have to stop with the third term and even then the error 
might be nearly 8/15. Hence the necessity for caution in finding sums 
by means of Euler’s formula. 


58. Caution in the Use of Quadrature Formulas. The student should 
ever bear in mind that when computing the value of a definite integral by 
means of a quadrature formula he is really replacing the given integrand 
by a polynomial and integrating this polynomial over the given interval of 
integration. The accuracy of the result will depend upon how well the 
polynomial represents the integrand over this interval; or, geometrically, 
on how well the graph of the polynomial coincides with the graph of the 
integrand. Before beginning the computation of an integral by a quadra- 
ture formula the computer should ascertain the nature and behavior of the 
integrand over the interval of integration. In some instances it may be 
necessary to construct an accurate graph of the integrand. The computa- 
tion can then be planned with reference to the nature and behavior of the 
function to be integrated. The following example will illustrate this point. 


Example 1. Find by Simpson’s Rule the value of the integral 

X * x*V 1 — dx 

, (2 — x)^^' 


I 



Art. 58 } 


CAUTION IN USE OF FORMULAS 


169 


Solution. The integrand is evidently negative from z — — 1 to x — 0, 
and positive from x — 0 to x — i 1. Hence we divide each of these intervals 
into four equal parts and compute the value of the integrand at each point 
of subdivision. The results are given in the table below. 


X 

y 

Z 

y 

-1 

0 

0.25 

0.000001555 

- 0.75 

- 0.0001231 


0.000485 

- 0.50 

- 0.00001753 

0.75 

0.02070 

- 0.25 

0 

- 0.000000304 

0 

1 

0 


On applying Simpson’s Rule to these tabular values we find 

/®.i 0.0000441, 

Zo* — 0.006981, 

/ 0.0000441 + 0.006981 — 0.006937. 

This result could be accepted with confidence if the tabular values were 
of the same order of magnitude, but the table shows that the integrand 
at X 0.50 is enormously larger than it is for smaller values of x, and 
that at X 0.75 it is enormously larger than at x — 0.50. Hence we had 
better examine this function more closely in the region from x — 0.50 to 
X — 1 and possibly make a new computation of the integral. 


X 

y 1 

X 

y 

0.50 


0 80 

0 038468 

0.55 


0.82 

0 048654 

0.60 

0.002514 

0.84 

0 061016 

0.65 

0.005297 

0.86 

0 075765 

0.70 

0.010688 

0.88 

0 092918 

0.75 

0.020701 

0.90 

0 11221 


* 0.038468 

0.92 

0.13259 



0.94 

0.15149 



0.96 

0.16306 



0.98 

0 15190 



1 

0 







170 


NUMERICAL INTEGRATION 


[Chap. VIII 


The above table shows the variation of the integrand in the interval 
0.50 ^ X ^ 1, and Fig. 4 shows the graph for the whole interval from 
X— • — Itox — 1. A glance at the graph shows that in order to obtain 


Y 



a trustworthy result we should divide the computation into three distinct 
parts: 

(1) By taking h — 0.25 in the interval — 1 < x < 0.5, 

(2) By taking h 0.05 in the interval 0.5 < x < 0.8, 

(3) By taking h 0.02 in the interval 0.8 < x < 1. 


Art . 58 ] 


CAUTION IN USE OF FORMULAS 


171 


The results of these computations are 

7 — — 0.0000031, 

7 J;; =. 0.002898, 

I J , — 0.020661, 

I 0.0000031 + 0.002898 + 0.020651 =- 0.0235. 

Even when the graph of the integrand is a smooth, regular curve in 
the interval of integration, a quadrature formula may not give a very 
accurate result unless the subdivisions are very small. This fact is illus- 
trated by the following example. 

Example 2. Find by Simpson’s Rule the value of 
7 — ^ V(1 — iP*) (2 — x)dx. 

Solution. The values of the integrand are given in the table below. 


X 

V 

X 

y 

-1 

0 

0.1 

1.371496 

- 0.9 

0.742294 

0.2 

1.314534 

- 0.8 

1.003992 

0.3 

1.243756 

- 0.7 

1.173456 

0.4 

1.159310 

- 0.6 

1.289961 

0.5 

1.060660 

- 0.5 

1.369307 

0.6 

0.946573 

- 0.4 

1.419859 

0.7 

0.814248 

- 0.3 

1.446720 

0.8 

0 657267 

- 0.2 

1.453272 

0.9 

0.457165 

- 0.1 

1.441874 • 

1 

0 

0 

1.414214 




The correct value of the given integral to five significant figures is found 
from a table of elliptic integrals to be 

7 = 2.2033. 

Simpson’s Rule g^ves the following values for different values of h : 

(a) 7 — 2.0914 for fe — 0.5. Percentage error = 5.1%. 

(b) 7 — 2.1761 for fc — 0.2. Percentage error = 1.289? • 

(c) 7 — 2.1934 for A — 0.1. Percentage error = 0.42%. 


12 




172 


NUMERICAL INTEGRATION 


[Chap. VIII 


It will be observed that when the interval of integration was divided 
into 20 subintervals the error was nearly a half of one per cent, which is 
less than slide-rule accuracy. Inasmuch as the tabular values are all 
correct to six or seven figures, the errors in the results found above are 
due entirely to the inherent inaccuracy of Simpson’s Rule. The trouble 
with this problem lies in the fact that the integrand cannot be approxi- 
mated closely by a polynomial near the end points of the range of integra- 
tion, for at these points the slope of the integrand is infinite. A better 
approximation can be obtained by using horizontal parabolas for the regions 
near the ends of the interval (see Note on p. 143). Thus, for the region 
at the left end of the interval, we have 

/i — (2/3>(0.2 X 1.003992) —0.1338656; 

and for the region at the right end we have 

7, — (2/3) (0.2 X 0.657267) — 0.0876356. 

Then the application of Simpson’s Rule to the region from 2 — — 0.8 to 
X — 0.8 gives 

It — 1.9780924. 

The sum of these is 

fi +/* + /s — / — 2.1996, 
the percentage error of which is 0.17 per cent. 

In Art. 62 several formulas will be derived for the inherent error in 
Simpson’s Rule, but occasionally a problem may arise when the approximte 
error cannot be easily determined even with the aid of those formulas. 

59. Mechanical Cnbature. In this article we shall give two methods 
for finding the numerical value of a definite double integral of a function 
of two independent variables. The first method will be by application of a 
formula which may be regarded as an extension of Simpson’s Rule to 
functions of two variables. The second method is simply by repeated 
application of the ordinary quadrature formulas for one variable. 

To derive the double quadrature formula we start with the formula for 
double interpolation, namely (X) of Art. 46, and integrate this formula over 
two intervals in the y-direction and two in the x-direction, first omitting 
from the formula all terms involving the differences A**^, A®**, A®*®, A®*®, 
Ai-fs, ^044^ since these differences involve values of the function outside the 
rectangle over which we are integrating. 

Since dx — hdu, dy — kdv we have, after omitting the terms just 
mentioned, 



Art. 59] 


MECHANICAL CUBATURE 


178 


/— J J zdydx J j *00 + 

+ — 1)A*"®2oo + 2ttvA’**2«o + »(» — l)A®**«oo] 

4-‘|’ [3“(“ — l)t;A**‘2oo + 3«t>(v — 1)A‘**2 oo] 

+ ^[6«(tt — l)i>(i> — 1)A***2 oo] 

Performiog the indicated integrations and replacing the double differences 
by their values as given in Art. 45, we get 

hk 

( 1 ) / "■ [*00 “ 1 “ *0* " 1 " *22 " 1 “ *20 “f" 4(*01 “H *>2 " 1 " *21 “I" *lo) “f" 16 *n]' 

This is the formula which corresponds to Simpson’s Rule for a function 
of one variable. It can be represented diagramatically as shown in Fig. 5, 
the coefficients of the several z’s being shown on the diagram. By adding 
any number of unit blocks of this type we could obtain a general formula 
for double integration, corresponding to Simpson’s Rule for n intervals in 
single integration, but it is not worth i»hile to do this. 

Formula (1) can be rewritten in either of the following forms: 

(2) /=» (*00+4*01+*02)-1- 4 - — (210+4211-1-2, 2 )-!- g (*02+4 *i2+*22)], 

(3) f “ (*00+42 , o+*2o) + 4 ■—(*01+42,1+221)+ g (*02+4*,2+*22)]» 

Now, such an expression as (fc/3) (200 + 42 ,o + *20) is nothing but Simpson’s 
Rule applied to a single row in thg diagram, in this case the top horizontal 
row. Let us put 

k k 

i4o “ g (*oo + 42,0 + * 2 o)> Ai (2o, + 4*11 + *2l), ®tc. 

Then (3) becomes 

(4^ /-|(Ao + 4Ai + A,). 

This formula shows that formula (1) is equivalent to applying Simpson’s 
Rule to each horizontal row in the diagram and then applying it again 
to the results thus obtained. These considerations lead to the following 
general statement: 



174 


NUMERICAL INTEGRATION 


[Chap. VIII 


If we are given a rectangular array of values of a function of two 
variables, we may apply to each horizontal row or to each vertical column 
any quadrature formula employing equidistant ordinates, such as Simpson's 
and Weddle's formulas. Then to the results thus obtained for the rows 
(or columns) we may again apply a similar formula. 



This important result makes it unnecessary to derive general formulas 
for approximate double integration. 

It is instructive to notice the geometric significance of this general 
statement. Since the double integral between constant limits of a func- 
tion of two variables is represented by the volume of a solid having a 
rectangular base and a height at any point equal to x[—f(x,y)], it is 
evident that the integrals Ao, Ai, etc. are merely vertical cross-sectional 
areas of this solid made by equidistant planes. Then <when we apply a 
quadrature formula to these il’s we are merely finding the volume of the 
solid, as if we evaluated the integral f^Atdx. 

An engineering application of mechanical cubature would be the solution 
of such a problem as the following: 


Art. 50] 


MECHANICAL CUBATURE 


176 


Suppose it were necessary to determine the amoimt of earth to be moved 
in making an excavation for a large building on uneven ground, or in 
grading down or filling in a city block. The area to be excavated would 
be divided up into small rectangles by running two systems of equidistant 
parallel lines at right angles to each other. The distances of the corners 
of these rectangles above or below an asumed datum plane would be the z*s 
of this article. Knowing these s’s and the distances between the parallel 
lines (the k’s and k’s), we could find the volume of the excavation by the 
methods given above. 

We shall now work two examples by these methods. 

Example 1. Find by formula (1) the value of the integral 

“"J* J» xy ‘ 

Solution. Taking h =* 0.2 and h =■ 0.3, we compute the values of 2 — 1/xy 
shown in the table below. 


X 

4 0 

4 2 

4 4 

2.0 

0.125000 

0.119048 

0.113636 

2.3 

0.108696 

0. 103520 

0.0988142 

2 6 

0.096154 

0.0915751 

0.0874126 


Substituting these in (1), we get 

j _ X 0.3 j-Q ^2500 + 0.096154 + 0.0874126 + 0.113636 

+ 4(0.108696 + 0.0916751 + 0.0988142 
-}- 0.119048) + 16 X (1.103520] 

— 0.0250070. 

The true value of the integral is 


f" f**^-.lnl.lXlnl.3 
J* Jt xy 

— 0.0953108 X 0.262364 

=1 0.0260061. 

The error is therefore 


E — 0.0260061 — 0.0250070 — — 0.0000009. 










176 NUMERICAL INTEGRATION [Chap. VIII 

Example 2. Find by numerical integration the value of the integral 



Solution. Here we take h — • 0.2, k — 0.3 as before, and compute the 
following table of values of s — l/xy. 


\ X 

y\ 

4.0 

4.2 

4.4 

4.6 

4.8 

5.0 

5.2 


l!10S!ll 

0.119048 

0.113636 

0.108696 

0.104167 

0.100000 

0.096154 


twil 

0. 103520 

0 0988142 

0.0945180 

0.0905797 

0.0869565 

0.0836120 


UMIMI 

0.0915751 

0.0874126 

0.0836120 

0.0801282 

0.0769231 

0.0739645 


0.0862069 

0.0821018 

0.0783699 

0.0749625 

0.0718391 

0.0689655 

0.0663130 


0.078125 

0.0744048 

0.0710227 

0.0679348 

0.0651042 

0.0625000 

0.0600962 


Applying Weddle’s Rule to each horizontal row, we have 
Ao — 0.06[0.125000 + 5(0.119048) + 0.113636 + 6(0.108696) 

+ 0.104167 + 5(0.100000) + 0.096154] 

— 0.131182, 

Ai — 0.114072, A, — 0.100909, A, — 0.090470, 

At — 0.081989. 

Now applying Simpson’s Rule to the A’s, we get 
I — 0.1[0.131182 -I- 4(0.114072) + 2(0.100909) 

+ 4(0.090470) + 0.081989 — 0.123316. 

I 

The true value of this integral is 

X* * X* * — 1“ 1-3 X In 1.6 — 0.123321, 

and the error is therefore 

E — 0.123321 — 0.123316 — 0.000005.O 

60. Prismoids and the Prismoidal Formula. The formula to be con- 
sidered in this article is a special form of Simpson’s Rule and the oldest 
form of that role. It is treated here because of its importance and wide 







Art. flO] PRISMOIDS AND THE PRI8MOIDAL FORMULA 177 

applicability in the mensuration of solids. The formula will be derived 
with reference to a solid. 

A prismoid may be defined as a solid whose bases are polygons in parallel 
planes and whose lateral faces are ruled surfaces, either plane or warped. 

If we let one base of the prismoid lie in the yz-plane and let the lateral 
faces extend in the general direction of the positive x-azis, the volume 
of the prismoid will be given by the integral 

f A (x)dx, 

where A{x) denotes the area of a cross section parallel to the bases and 
at a distance x from the ys-plane. To find A{x), let 

(1) y — 005 + 5 

(2) s — ftc + d 

be the equations (in projection form) of any moving straight line. Such 
a line can move in any manner and will generate a ruled surface as it 
moves along. In a plane parallel to the yz-plane and distant x to the 
right of it the area of the cross section of the prismoid is given by the 
integral 

Aix)^ jydzy 

the integration being extended over the entire cross section. 

Now, in the fixed plane under consideration 05 is a constant, and y and 
X are functions only of the parameters a,b,c,d; that is, in this plane 
y and z can vary only when and as the parameters vary. Let each of these 
parameters be a function of a single parameter a. Then y and z become 
functions of a, and from (2) we have 

dz — ^ (ci + d)da — (c'a; + d')da. 

The integral giving the area of the cross section now becomes 

A(x) — / ydz — ^ (ax + 5)(c'x + d')da 

— X* r^c'd« + x r"(5c' + od')d«+ r bd'da, 

where it is assumed that the entire cross section is covered when a varies 
from Xo to a,. Since each side of the cross-sectional polygon is a section 
of a different rujied surface, the integrands above will change (be replaced 
by others) as the different sides of the polygon are encountered.* Also, 

*The integral giving A(«) in this problem is really a line integral. See B. B. 
Wilaon's AAvwMtd Calouliu, p. 289, or Goursat-Hedrick, Mathematical Analysit, 
Vol. I, pp. 187-189. 



1T8 


NUMERICAL INTEGRATION 


[Chap. VIII 


since the above integrals in the expression for A{x) are independent of x, 
they may be denoted by p, q, r, respectively. Hence we have the important 
result that 


( 3 ) 


A(x) =- pa:* 4- jx + r. 


which shows that the area of the cross section of a prismoid is a quadratic 
function of its distance from one base and therefore from either of its bases. 

To find the volume of a prismoid of length I, we take coordinate axes 
with the yz-plane coinciding with the midsection of the solid. Then 


X I/* r */* 

A {x)dx = I (px* qx + r)dx ^ ^ rl 

■I/I »/-l/2 -1^ 

Let Bi and B 2 denote the areas of the bases of the prismoid and let M 
denote the area of the midsection. Then from (3), 


Hence 


Af = 4(0) =r 

'(i) 


4^2^ 


pi* 

2 


pZ 

2 


+ Ba = ^ + 2r — -f 2il/, from which 


p = ^ (Hi Bz — 2A/). 


Substituting in (4) the values of p and r just found, we get 

(6) 7-i(Hi4-Ba + 4ilf), 


which is the Priamoiddl Formula. The reader should keep in mind the. 
verbal statement of this formula, namely: The volume of a prismoid is 
equal to one sixth the product of its length by the sum of its bases and 
four times its midsection. 

It is to be noted that the prismoidal formula gives the exact volume 
not only of the prismoid but also of any other solid in which the area of 
the cross section is a quadratic function of the distance of the cross section 
from one base. Such solids are cones, pyramids, spheres, spherical seg- 
ments, frustums of cones and pyramids, wedges, paraboloids of revolution, 
and other solids of revolution. It also gives with close approximation 
the volumes of barrels and casks. 

Note. Some writers use the terms prismoid and prismatoid inter- 
changeably, as if both terms referred to the same solid. Such is not the 



Art. 60] 


PRISMOIDS AND THE PRISMOIDAL FORMULA 


170 


case, however. A prismatoid is usually defined as a polyhedron (a solid 
with plane faces)) having for bases two polygons in parallel planes and 
for lateral faces triangles or trapezoids with one side common with one 
base and the opposite vertex or side common with the other base. The 
volume of a prismatoid is found by decomposing the solid into pyramids 
having their vertices at a point in the midsection. The volume of a pris* 
matoid is given by the prismoidal formula because the area of the cross 
section of a pyramid made by a plane parallel to its base is proportional 
to the square of the distance from the section to the vertex (or base) of 
the pyramid; that is, the area of the section is a quadratic function of its 
distance from the base. 

The prismoidal formula is the fundamental formiila for computing the 
volume of earth in cuts and fills for railroads, highways, canals, etc. The 
cross-sectional areas of cuts (or fills) are determined at convenient intervals 
(100 feet apart or less) and then the volume of earth to be excavated 
(or filled in) is computed by the prismoidal formula. 

Example. A proposed railroad cut 100 feet long is represented approxi- 
mately to scale in Fig. 6. Compute the number of cubic yards of material 
to be excavated. 


E 



Fn». a 



180 


NUMERICAL INTEGRATION 


[Chap. VIII 


Solution. The areas of the bases and midsection are found by subtracting 
the areas of the two triangles from the area of a trapezoid in each case. 
Hence 

Br - 51 — i(8 X 12) — i(14 X 21) - 366 sq. ft, 

B* - 45 — i(e X 9) — i(12 X 18) - 270 sq. ft, 

M — 48(10) — 150 — 330 sq. ft, 

and therefore 

F — ^ (366 4- 270 + 4 X 330) — 32,600 cu. ft — 1207.4 cu. yd. 

When the prismoidal formula is used to find the volume of a solid in 
which the cross-sectional area does not vary as the square or cube of the 
distance of the section from one base, the error committed is the same as 
that in Simpson’s Rule (see Art. 62). 


EXERCISES VIII 

1. In the table below are given corresponding values of a variable x 
and an unknown function y. For what value of s is y a minimum? 


z 

V 

3 

-205 

4 

-240 

6 

-260 

6 

-262 

7 

4-250 

8 

-224 


2. For what value of x is the following tabulated function a minimum? 


X 

V 

0.2 

0.9182 

0.3 

0.8975 

0.4 

0.8873 

0.5 

0.8862 

0.6 

0.8936 

0.7 

0.9086 





Art. 60] 


EXERCISES 


181 


8. In the year 1918 the declination of the sun at Greenwich mean noon 
on certain dates was as given below. Find when the declination was a 
maximum. 


Date 

Declination 

June 19 

23“ 26' 23' .6 

“ 20 

“ 26 19 .4 

“ 21 

" 26 60 .5 

“ 22 

“ 26 56 .8 

“ 23 

“ 26 38 .3 

" 24 

“ 26 66 .1 

“ 25 

" 24 47 .1 


4 . Compute the value of 


/•»/* 

■ — J V 1 — 0.162 sin* ^ 


by Simpson’s Rule and by Weddle’s Rule, taking 

^ — 0% 16*, 80®, 46®, 60®, 76®, 90®. 

Compare your results with that found by the series method in Exercise 
26, Chapter I. Also compare the amount of labor involved in each case. 

1 dx 

6. Compute J by Gauss’s method, taking n — 6. 

6. Compute by Simpson’s Rule the value of the integral 

ds 

""Jio* logio®’ 

taking eight subintervals. 

7. Find by Weddle’s Rule the value of the integral 

■*•• xda; 


xdx 

““ J ,.4 sinh X ’ 


teUn g twelve subintervals. 



182 


NUMERICAL INTEGRATION 


[Chap VIII 


8 . Find by Euler’s quadrature formula the value of the integral 

/ — ^ cos x^dx 

and compare the result with that found by integrating the series for cos x*. 

9. Find by Euler’s summation formula the sum of 

-L + -1- + .. +J-+-L. 

400 ^ 402 ^ ^ 408 500 


10. Find the value of the integral 


J V0O® 

logu, 

so* 


sin r dr 


by Simpson’s Rule, taking ten subintorvals. 

11. Compute by any method the value of the integral 

t /3 

V COS) 0 do. 

0 


12. Compute to five decimal places the value of 

xdx 




0 cos X 

13. Using the data of the following table, compute the integrals 



(b) r y^dx, (c) f x^ydx, 

•/ 0 5 0 a 

J -i 1 

y^di, 

W S 


by Simpson’s Rule. 


X 

0.6 

0.6 

0.7 

0.8 

0.9 

1.0 

1.1 

y 

0.4804 

0.5669 

0.6490 

0.7262 

0.7985 

0.8658 

0.9281 



CHAPTER IX 


THE ACCURACY OF QUADRATURE FORMULAS 

61. Introduction. A computer should have some means of estimating 
the reliability of every computed result. It is not always possible to have 
an explicit formula giving the error committed, but usually there exists 
some means for ascertaining the magnitude of the majority of unavoidable 
errors. In the present chapter we consider the accuracy of the more im- 
portant quadrature formulas and give expressions for the inherent errors in 
several of them.* 

62. Formulas for the Inherent Error in Simpson's Rule, (a) The 

OenercU Formula. Let f{x) denote a function which is finite and con- 
tinuous in the interval x — xo — h to z Zo + h and has continuous 
derivatives of all orders up to and including the fourth in that interval. 
Furthermore, let F{x) denote the integral of /(z), so that 

F{x) - ji'j(x)dx, F'ix) -/(z), F"(z) -fix), etc. 

Then 

J 'mo*k 

fix)dx^Fixo + h) —Fixo — h). 

•rk 

The value of this integral by Simpson’s Rule is 

la-j [/(*» - A) + 4/(z,) + fixo -1- A)]. 

The difference between these results is the inherent error in Simpson’s Rule, 
so that 

( 1 ) Ea — I~Ia-F{xo + h)—Fixo — h)—^[fiXo — h) + ^fixo) 

+ /(®o + A)]. 

This formula is of no practical value as it stands, because it is indefinite 
and cumbersome. It can be reduced to a -simple and workable form in 
either of two ways. 


183 



184 


ACCURACY OF QUADRATURE FORMULAS 


[Chap. IX 


1. The expression for Es is clearly a function of h. Hence, following 
the method of Vall^e-Poussin, we denote it by ^(A) and write 

^ih)’=F{xo-hh)—Fix^-h)-j[f{xo — h)-\-ifixo) + f{xo + k)l 
Differentiating both sides successively with respect to h, we have 

- J [/(*. + ») + 4 /( 1 .) +/(*.-»)], 

♦"(A) - /-(*. + 1) - rci. - *) - A [r(i. + ») + r(i. - *) 1 
-|[/'(^. + A)-/'(^.-A)], 

♦'"(A) J IT'i^ + A) -r(i.- A)]. 

By the theorem of mean value this last expression in brackets is equal to 
where Xo — A < f < Xo + A. Hence 

9h^ 

(2) r'ih) 

On putting A = 0 in the expressions for <f>{h). <f>'{h), 4>"{h), we find that 
^(A) =.^'(A) =^"(A) =0. 

We now integrate (2) three times with respect to A, either l)y integrating 
from 0 to A or by determining the constant of integration at eaeii .step. 
In either case we make use of the relations ^{h) =0, <A'(A) =0, and 
=0 for A ■= 0. Although the factor depends to some extent 

on the magnitude of the interval (0, A), we ran replace it by its mean value 
in the interval and remove it from under the integral sign. Assuming that 
this is done at each step, we have 

Oki 

^-(A) /-(f), 

2A* 

♦"(A) .--^/"(i), 

♦'(A) 

♦(A) — 



Art. 62] 


FORMULAS FOR ERROR IN SIMPSON'S RULE 


189 


Since this ^{h) is the inherent error for an interval of width 2h, the 
inherent error for a eubinterval of width h is half this amount, or 

Then since b — a^tih, we have as the inherent error for the whole 
interval b — a: 

(6*. (- ) ^/K.(£) *•/.'«), 

where $ now lies between a and b, or a < £ < 6. 

2. Another way of simplifying (1) is by means of Taylor^s theorem. 
Expanding ^(xo + ^) — b) by Taylor’s theorem and remem- 

bering that F'{xo) —fixo), F"(xo) — f(*o)» etc., we have 

F(x. + ») - p(*.) + */(^) + f - f (*.) + ^ r'(*.) + ■ •■ 

Also, 

(**)+• ■ ■ 

On substituting in (1) these values for F{xo + b), F{xo — A), /(x© + h), 
and /(xo — b), we get 

B, ^[/^(*.)+- • ]. 

Hence the inherent error for the whole interval b — a is 

(3) Eb ^ + /*"(*.) + • • • + r (**-.)] 

Let denote the greatest value of any of the »/2 quantities within 

the bracket. Then 

(6*.*) as-^/^(*.) — 

The values found for Ea show that Ea’—O when P''(x) —0. Hence 
when f(x) is a polynomial of the first, second, or third degree, Simpson’s 
Rule gives the exact value of f*f(x)dx. 



186 ACCURACY OF QUADRATURE FORMULAS [Chap. IX 

(6). A Formula in Terms of Differences. In many applications ot 
Simpson^s Buie the analytical form of the function to be integrated is 
either totally unknown or else is of such a nature that its fourth deriva- 
tive is difficult to calculate. In either case formula (3) can not be applied 
as it stands. We get around the difficulty by transforming it into another 
form. 

Let us replace the derivatives - • , etc. by their values 

in terms of differences. For this purpose we write Stirling’s interpolation 
formula in the form 

»-/(*)- /(i + »«)- y. + « 

, u(tt* — 1*) 4- , 

■^3! 2 ■ 

to fourth differences. 

Differentiating this formula with respect to x by means of the formula 
dy/dx — (dy/du) (du/dx) and the relation a: — k -|- hu, or u — (x — k)/h, 
and then putting u — i 0 in each derivative, we get 

Now putting k — *„ x,,‘ • • Xn-i, writing — A*y.,, etc., and sub- 

stituting in (3) these values of the fourth derivatives, we get 

(4) Ea——~ (A*y-i -1- A*yi A«y, -f ’ • ' + A^y»-,) . 

This expression for the error in Simpson’s Rule is identical with the 
third set of terms in our central-difference quadrature formula (63.1). 
That formula is therefore Simpson’s Rule plus its correction terms, as was 
stated on page 146. 

(c). A Formula in Terms of the Given Ordinates. To get a formula 
for Eb in terms of the given ordinates, we replace the differences in (4) 
by their values in terms of the y’s as given in Art. 16, Table 3. Since 

— y» — 4y* -H 6yi — 4yo y.i, 

AVi — ys — 4y« + 6y, — 4yj -|- yi, 


A*yn-3 yn*i — 4y» -|- 6yii-i — 4yii.2 yii-3» 


we have, on substituting these in (4), 



Abt. 82] 


FORMULAS FOR ERROR IN SIMPSON’S RULE 


187 


(5) [y-i + jfH*i — 4(yo + yi») + 7(yi + jfn-t) 

— ®(y* + y4 + ’ • • + yii-*) + 8(yi + ^5 + • ■ + y«-»)] 

▼hen n ^ 6. 

If the number of subintervals be less than six, the formulas for ^5 are 
h 

(6) Eb — — ^ [y-i + Jfa — 4(yo + Jfa) + 6yi], for n — 2. 

(7) Ea^ — ^ [y-i + ys — 4(yo + y4) + 7(yi + y,) — 8y*], for n — 4. 


The ordinates y.i and ynn, which are outside the interval of integration, 
can be found in one or more ways. If the values of y are computed from 
a formula and the formula holds outside the interval of integration, then 
we merely compute y.i and ya„ from this formula by substituting the 
proper values of x. But if we are given only a tabular set of y’s we find 
y.i and ya,i by extrapolation, the former by using Newton’s formula (I) 
and the latter by using Newton’s formula (II). 

(d) A Formula in Terms of Two Computed Results. Suppose two 
computations of a definite integral are made by Simpson’s Buie, using a 
different value of h for each computation. Let Ri, hi, Ei denote the 
result, the value of h, and the error in the first computation, and let Ra, 
ha, Ea denote the corresponding quantities in the second computation. 
Then by (62.1) or (62.2) we have 


Ea ha* ’ 


or 


El- 


ha* 


Ea . 


Hence if ha 


^ , we have 


El — 16 Ea . 


Let I denote the true value of the given integral. Then for the two 
computations we have 

I Ri El Ri “1“ 16^f , 

I “ Ra + Ea . 


Subtracting the upfier equation from the lower and solving for Ea, we get 


( 8 ) 


Ea — 


Ra — R\ 

15 


This formula tells us that if we compute the value of a definite integral 
by using a certain value for h and then compute it again by using twice 


13 



ACCURACY OF QUADRATURE FORMULAS 


188 


[Chap. IX 


as many subdivisions, the error of the second result will be about l/15th 
of the difference of the two results. 


(e) To Find the Value of h for a Stipulated Degree of Accuracy in the 
Result. If we wish to know the value of h to obtain a result of stipulated 
accuracy, we substitute in (62.1) or (62.2) the allowable error E, the 
maximum value of /‘''(x), the value of b — a, ignore the negative sign, 
and solve for h. 

In case f"'(x) cannot be found, assume a convenient value hi for h, 
find El by (4) or (5), and use the relation^ “T* 

J!f p n 


from which 


( 9 ) 



where E, denotes the allowable error. 

We shall now apply formulas (5) and (62.1) to the first example 
worked in Art. 60. 


Example. Compute by means of (5) and (62.1) the error in the 
evaluation of J’^lnardi by Simpson’s Rule. 

Solution. We mu.st first compute y.j and from the given function 
y — In a:. For these we have 

y.i — In 3.8 = 1.33500107, 
yn.i = In 5.4 = 1.68639895. 


The values of y from yo to y» inclusive are given in the table on page 134. 
Substituting these y’s in (5), we get 


Ea ^[3.02140002 -4(3.03495299) 

+ 7(3.04452244) —8(3.05022046) 
+ 8(1.52605630)] 

= 0.00000015. 


The true error was found in Art. 62 to l)e 0.00000015. 

To compute the error by (62.1) or (62.2) we first find f"'{x) from 
the equation f{x) = In ». We thus have 

/"(») |r- 

Hence 

Es ^ (0.8)‘ (ilji) - 0.00000026. 



Art. 63] 


INHERENT ERROR IN WEDDLE'S RULE 


189 


This is of the same order of magnitude as the actual error but greater than 
it, as it should be. 

Suppose we wished to know the value of h necessary to give the integral 
correct to ten decimal places. Since we have already found the error 
corresponding to a particular value of h, we can find the desired value 
by substituting in formula (9). Here 

hi — 0.2, El — 0.00000015, E, < 0.00000000005. 


Hence we have 


h<0 




00000000005 N’* 


00000015 / 


— 0.027. 


Since h — a^nh, we find that we should have to divide the interval 
(4,5.2) into more than 45 subintervals in order to get a result correct 
to ten decimal places. 


63. The Inherent Error in Weddle's Rule. In deriving Weddle's Rule, 
we omitted the quantity — omitted quantity is the principal 

part of the error inherent in the formula. Tn terms of derivatives it is 

Hence 


( 68 . 1 ) 


“ 140 ““ 140 


This means that when f(x) is a polynomial of the 5th degree or lower, 
Weddle’s Rule gives an exact result. 


64. The Remainder Terms in aCentral-Difference Formulas (53. 1) 
and (63.8). The remainder terms in these formulas can be found by 
integrating the remainder terms in Stirling’s and Bessel’s interpolation 
formulas from which (53.1) and (53.3) were derived. Since (53.1) is 
at least as accurate as (53.3), and since a more definite formula can be 
derived for the remainder term in the latter than in the former, we shall 
derive the remainder term for (53.3) only and use it for computing 
the error in both formulas. In Art. 40 we found the remainder term in 
Bessel’s formula (Wl) to be 




Since f(x)dx is the quantity that is integrated by a quadrature formula. 



190 


ACCURACY OF QUADRATURE FORMULAS 


[Chap. IX 


it is plain that Rn{x)dx is the quantity which must be integrated to find 
the inherent error in the quadrature; and since dx^hdv, we have for 
the error in a single subinterval of width h 


V 4 ) (2t. + 8)! 

Let us put 

( 2 ) 


Then 


(2n + 2)! 


This is the error for a single subinterval of width h. Let Mn denote 
the mATimum value of /<****’ ( 2 ) in the interval (a, b). Then since there 
are (b — o)/h subintervals from x — a to « — b, we have for the total 
error in the interval {a, b) 


( 8 ) 


E< 


2fc*-*Af, 

(2n + 2) ! 


(6-0)1 I . 


From this general formula we get particular ones by assigning values 
to n. Thus, if we include fourth differences in (53.3) and neglect all 
higher differences, we put n = 2. Then (2) becomes 


- X* (*■ - i) f) - 


and therefore (3) becomes 


or, more simply, 

( 4 ) 


60480 


(b-o); 




316 


(b — o). 


191 

168* 


In terms of differences this becomes 


( 6 ) 

where A*y is the largest of the sixth differences. 



An. M] REMAINDER IN CENTRAL-DIFFERENCE FORMULAS 


191 


If ve include sixth differences in (63.3) and neglect all higher differ- 
ences, then n — 3 and (2) becomes 


On substituting these in (3) we find 


2497 

180 


or 

( 6 ) 


2497A«lf. 
“ = 3628800 




1453 

In terms of differences this becomes 


( 7 ) 




where is the largest of the eighth differences in the interval (a, &). 

When we stop with fourth differences in formula (63.1) or with third 
differences in (63.3), the error is to be computed by (6) ; and when we 
stop with sixth differences in (63.1) or with fifth differences in (68.8), 
the error is to be computed by (7). 


66. The Inherent Errors in the Formulas of Ganai, Lhhatto, and 
Tchebycheff. The inherent errors in the formulas of Gauss, Lobatto, and 
Tchebycheff are usually given in terms of the coefficients in a power series. 
To find the error in any given case, it is therefore necessary to expand the 
function ^(u) as a power series and pick out the appropriate coefficients, 
one 01 more. 

For Oauss's formula the principal part of the inherent error is * 


( 1 ) Ea 


b — a 


( -Jil V 

\l-3-6- • •(2n — 1)/ 

I'M*]/ ( n -h l)(a + 8) 
2n + 3 

where the L*a are the coefficients in the power series 


(2n-t- 1)2*- 

X 




n(n — 


2n 




(2) ^(tt) — £-0 + Eiu + -H • • • + -f • • • . 

When the series for ^(u) is rapidly convergent, the term involving Ln^t 
in formula (1) may be omitted. 

If the analytic form of ^(«) is not known, Eo cannot be determined. 

* Derived in Todhunter’a Fwnetioiw of Laplace, LanU, end Beeeel, p. 108. 



192 ACCURACY OF QUADRATURE FORMULAS 

Example. Find Eu for the example in Art. 54, p. 157. 


[Chap. IX 


Solution. 


Here 


^(u) — 


1 

7tt + 8.5 


8 8 , 14 

14u + 17 ” 17 ^ 17 ’ 


Since »-=5, 2« + 2 =“ 12. We must therefore find the coefficients of 
tt^o and tt'* in the series for <^{u). We have 







From this series we see that 


Em 



and Xr 2 ii-f 2 


17 



IS 


Substituting these in (1), we get 


Ea 


17 


X 


7 j 5! 1 * 1 /l4yY42 

11X2‘® M-3-5-7-9 j /V17/ "^8\l7j \13 
= 0.00000017 4- 0.00000008 — 0.00000025. 



This result agrees well with the actual error 0.00000028 found in Art. 64. 

There are no dimple formulas in terms of n for the inherent errors in 
the formulas of Lobatto and Tchebycheif. Formulas for particular values 
of n are given in the tables at the end of the book by B. P. Moors. 

66. The Remainder Term in Euler’s Formula. Malmsten’s expression 
for the remainder after m terms in Euler’s formula of summation and 
quadrature is , 

(1 ) (a + db) , 0 < 6 < 1, 


for a single subinterval of width h. 

Let M denote the numerically greatest value of /<•">(*) in the whole 
interval {a,b). Then for the n subintervals we have 

( 2 ) 

or, since n — (6 — a)/h, 

(6) o). 


Here has the following values: 



Art. 66] 


REMAINDER TERM IN EULER'S FORMULA 


193 


12’ 720’ 30240’ + 


1209600 * 


10 ^ ■ 


47900160 ' 


More useful, perhaps, than formula (3) is the following working rule 
due to Charlier: 

In stopping with any term in Euler's formula the error committed is less 
than twice the first neglected term. 

Hence we get the most accurate result by stopping with the term just 
before the smallest, so that the first neglected term is the smallest of all. 

We shall now show that the first two terms of Euler’s formula will give 
a more accurate result than Simpson’s Rule. 

Putting m~2 in formula (3), we have 

h*Hf 

R, S - a) - ^ - a), 

where M denotes the greatest numerical value of p''{x) in the interval 
(a, 5). 

The remainder term in Simpson’s Rule is (Art. 62) 

F (h 


Hence the inherent error in Euler’s formula for only two terms is just 
one fourth that in Simpson’s Rule. 


BXBRCISB8 IX 

1. Compute the inherent errors*in the answers to Exercise 4 of Chapter 
yill and compare these errors with that found in Exercise 25 of Chapter I. 

S. Compute the inherent error in the answer to Exercise 6 of 
Chapter YIIl. 

3. Estimate the accuracy of the answer to Exercise 8, Ch. VIII. 


* MwKtmik dee Bimmels, II, pp. 13-16. 



CHAPTER X 


THE SOLUTION OF NUMERICAL ALGEBRAIC AND 
TRANSCENDENTAL EQUATIONS 

I. EQUATIONS IN ONE UNKNOWN 

67. Introduction. It is shown in algebra how to solve literal equations 
of all degrees up to and including the fourth; and it is also shown how 
to compute the roots of numerical equations of any degree. Algebra is 
silent, however, on the solution of such types of equations as ox ^ log ^ 
— c, ae'* -f 6 tan X — 5, etc. These are transcendental equations, and no 
general method exists for finding their roots in terms of their coefficients. 
When the coefficients of such equations are pure numbers, however, it is 
always possible to compute the roots to any desired degree of accuracy. 

The object of the present chapter is to set forth the most useful methods 
for finding the roots of any equation having numerical coefficients. Since 
Horner^s method is explained in most college algebras, and since it can not 
be applied to transcendental equations, we shall not consider it here. 

68. Finding Approximate Values of the Roots. In finding the real 
roots of a numerical equation by any method except that of Graeffe, it is 
necessary first to find an approximate value of the root from a graph or 
otherwise. Let 

( 1 ) /(*)-0 

denote the equation whose roots are to be found. Then if we take a set 
of rectangular coordinate axes and plot the graph of 

(2) y— /(*)> 

it is evident that the abscissas of the points where the graph crosses the 
x-Sxis are the real roots of the given equation, for at these points y is zero 
and therefore (1) is satisfied. Approximate values for the real roots of 
any numerical equation can therefore be found from the graph of the 
given equation. It is not necessary, however, to draw the complete graph. 
Only the portions in the neighborhood of the points where it crosses the 
x-axis are needed. 

Even more useful and important than a graph is the following funda- 
mental theorem: ' 

If f (x) is continuous from x — a to x — b and if f (a) and f (b) have 
opposite signs, then there is at least one real root between a and b. 

This theorem is evident from an inspection of Fig. 7, for if /(a) and 


104 



Art. 68] 


FINDING APPROXIMATE VALUES OP ROOTS 


106 


f{h) haye opposite* signs the graph must cross the a^-azis at least once 
between x — a and x^b. 



In most cases the approximate values of the real roots of f (x) 0 are 

most easily found by writing the equation in the form 

( 3 ) /.(*)-/.(«) 

and then plotting on the same axes the two equations 

yi — y* — /2(*). 

The abscissas of the points of intersection of these two curves are the real 
roots of the given equation, for at these points y, — yt and therefore 
fi{x) ’^ft{x). Hence (3) is satisfied and consequently f{x) — 0 is like- 
wise satisfied. 

We shall now apply the foregoing methods to two examples. 

Example 1. Find approximate values for the real roots of 

X logio ^ 

Solution. We write the equation in the form /(x) — x logto a: — 1.2, 
assign positive integral values to x, and compute the corresponding values 
of /(x), as shown in the table below. Since /(2) and /(3) have opposite 
signs, a root lies between x — 2 and x — 3, and this is the only real root 


X 

1 

2 

3 

4 

m 

• 

— 1.2 

— 0.6 

+ 0.23 

+ 1.21 


The approximate value of the roots can also be found by writing the 
equation in the form 




196 SOLUTION OF NUMERICAL EQUATIONS [Chap. X 

and then plotting the graphs of yx — logto x and y^ — \M/x. The abscissa 
of tlie point of intersection of these graphs is the desired root. 

Example 2. Find the approximate value of the root of 

3x — cos X — 1 — 0. 

Solution. Since this equation is the difference of two functions, we can 
write it in the form 

3x — 1 — cos X. 

Then we plot separately on the same set of axes the two equations 

yi — 3x — 1, 
y, — cos X. 

The abscissa of the point of intersection of the graphs of these equations 
is seen to be about 0.6 (Fig. 8). 

Y 



Of course we could also find this approximate value by computing a table 
of values of the function /(x) — 3x — cos x — 1 and noting the change in 
sign of /(x), as in Ex. 1. 

68 A. Finding Roots by Repeated Application of Location Theorem. 

Repeated appli(‘ation of the theorem of Art. 68 will give the real roots of 


Art. 09] 


METHOD OF FALSE POSITION 


107 


a numerical equation to any desired degree of accuracy. Each application 
adds an additional correct digit to the approximate root and at the same 
time shows the number of correct figures in the approximate root at any 
stage of the computation. We illustrate the method by continuing the 
computation of the root of the equation 

zlogx — 1.2 --iO, 
which we now write in the form 

f{x) —xlogx — 1.2 — 0. 


X 

m 

2 

—0.60 

3 

0.23 

2.7 

—0.035 

2.8 

0.068 

2.74 

—0.00066 

2.76 

0.0082 

2.740 

—0.00056 

2.741 

0.00031 

2.7406 

—0.000040 

2.7407 

0.000047 


The last pair of value.s shows that the root is about halfway between 
2.7406 and 2.7407, or 2.74065. 

This method of computing roots is not rapid, hut it is simple and reliable 
and is applicable to any type of numerical equation. It is preferable to 
the regula falsi method treated in the next article. It is sometimes called 
the ''pinch” method, but that name seems inappropriate. 

68. The Hlethod of Interpolation, or of False Position (Regiula Falsi). 
The oldest method for computing* the real roots of a numerical equation is 
the method of false position, or "regula falsi.” In this method we find 
two numbers x, and Xj between which the root lies. These numbers should 
be as close together as possible. Since the root lies between Xi and Xt the 
graph of y — f{x) must cross the x-axis between x — x, and x — xt, and 
yi and y, must have opposite signs. 

Now since any portion of a smooth curve is practically straight for a 
short distance, ii is legitimate to assume that the change in /(x) is pro- 
portional to the change in x over a short interval, as in the case of linear 
interpolation from logarithmic and trigonometric tables. The method of 
false position is based on this principle, for it assumes that the graph of 
ymmf(x) is a straight line between the points (xi,yi) and (xt,yt), these 



198 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


points being on opposite sides of the x-axis. 

To derive a formula for computing the root, let Fig. 9 represent a magni 



fied view of that part of the grpnh between and (®*, y*). Then 

from the similar triangles PMS a.id PRQ we have 


( 1 ) 


MS RQ h Xt — Ti 

MP~RP’ + 

• i (J!. — J.) |y, I 
■■ + • 


The value of the desired root, under the assumptions made, is 


Hence 

( 2 ) 


aC “ -Ti M8 ■" Xj h. 


X 


^1 + 


(gt— gi) |yi I 


This value of g is not, however, the true value of the root, because the 
graph oiy"^ f(x) is not a perfectly straight line between the points P and 
Q. It is merely a closer approximation to the true root. 

In the practical application of the regula falsi method we compute a short 
table of corresponding values of x and f{x) for equidistant values of x — 
units, tenths, hundredths, etc. Then by means of (1) we compute cor- 
rections to be applied to the previously obtained approximate values. The 
following examples should make the method clear. 


Example- Compute the real root of 

xlogiog — 

correct to five decimal places. 


GRAPHICAL METHOD 


199 


Art. 70 ] 

Solution. The short table in Example 1 of Article 68 above shows 
that the root lies between 2 and 3, and that it is nearer 3. Hence we make 
out the following table and then compute the corrections by (1). 



X 

y 


l8t 

2 

— 0.6 

* 0.83 

approx. 

3 

+ 0.23 

x(*) — 2 + 0.72 — 2.72. 

Difl. 

1 

0.83 





^ 0.1X0.04 

2nd 

2.7 

— 0.04 

*■- 0 09 


2.8 

+ 0.05 


approx. 

0.1 

0.09 

—2.74. 

3rd 

2.74 

— 0.0006 

0.01 X 0.0006 _ 


2.75 

+ 0.0081 

* 0.0087 

approx. 

0.01 

0.0087 

x(*) — 2.74 + 0.0007 — 2.7407. 

4th 

2.7406 

— 0.000039 

, 0.0001 X 0.000039 


2.7407 

+ 0.000045 

* 0.000084 

approx. 

0.0001 

0.000084 

— 0.000046. 


— 2.7406 + 0.000046 

— 2.74065. 


70. Solution by Repeated Plotting on a Larger Scale. The following 
method is the graphical equivalent of the regula falsi method and has the 
advantage of giving a visual representation of the approximating process. 

Suppose an approximate value of the root has been found from a graph 
or otherwise. Plot on a large scale a small part of the graph of y — ^ f(x) 
for values of x near the desired root, so that one can see more clearly about 
where the graph crosses the x-axis. An additional figure of the root can 
be read from this graph. Then plot on a still larger scale a small part 
of the graph for values of x near the improved value of the root (the Talue 
just found), and continue the process in this manner until the root has 
been found to as many figures as desired. The following example should 
make the method clear. 

Example Find the positive real root of 




200 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


Solution. We first compute the value of the left member for several 
values of x, as given in table (1). This table shows that a root lies 
between 0.5 and 0.6. Hence we plot the graph of the given equation from 
X — 0.5 to z — 0.6 and assume it to be a straight line within this intervaL 
The result is Fig. 10 (a), and it shows at a glance that the root is about 
0.56 or 0.57. We therefore compute table (2) and plot the results as 




Tw. 10 











Art. 71] 


NEWTON-RAPHSON METHOD 


201 


shown in Fig. 10 (b). This graph shows that the root is about 0.579. 
Continuing the process in this manner by computing tables (3) and (4) 
and plotting the results on still larger scales as shown in Figs. 10 (c) 
and 10 (d), we find the desired root to be x 0.57936 to five figures. 

This method and the regula falsi method are particularly valuable for 
finding the roots of complicated equations such as the one solved above. 

71. The Newton>Raph8on Method. When the derivative of f{x) is a 
simple expression and easily found, the real roots of /(x) 0 can be 

computed rapidly by a process called the Newton-Baphson method. The 
underlying idea of the method is due to Newton, but the method as now 
used is due to Baphson. 

To derive a formula for computing real roots by this method let a 
denote an approximate value of the desired root, and let h denote the 
correction which must be applied to a to gi\e the exact value of the root, 
so that 

X =■ o + h. 

The equation f(x) — 0 then becomes 

f(o + fc) —0. 

Expanding this by Taylor’s theorem, we have 


f(a + h)^ f{a) + hna) -f j f'ia + Bh), 

Hence 

/(o) + hfia) + j f'ia + eh) - 0. 


Now if h IS relativily small, Ve may neglect the term containing A* 
and get the simple relation 

f(a) + hria)^0. 


from which 

( 1 ) 



The improved 

( 2 ) 


value of the root is then 


Oi “ o + Ai 



See Cajori'e EUtvnf of Mothomotim, p. S08 



208 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


The succeeding approximations are 


flj Oj + 


. . . 0 , 


/(Ol) 

f (a.) ’ 


0 ,- 0 , 




fiOn-i) • 


/(«») 

n<hy 


Equation (1) is the fundamental formula in the Newton-Eaphson pro* 
cess.. It is evident from this formula that the larger the derivative f(x) 
the smaller is the correction which must be applied to get the correct 
value of the root. This means that when the graph is nearly vertical 
where it crosses the x-aAis the correct value of the root can be found with 
great rapidity and very little labor. If, on the other hand, the numerical 
value of the derivative /'(*) should be small in the neighborhood of the root, 
the values of h given by (1) would be large and the computation of the 
root by this method would be a slow process or might even fail altogether. 
The Newton-Eaphson method should never be used when the graph of 
f(x) is nearly horizontal where it crosses the x-axis. The process will 
evidently fail if f(x) ~ 0 in the neighborhood of the root. In such cases 
the regula falsi method should be used. 

We shall now apply the Newton-Eaphson method to two examples. 


Example 1. Compute to four decimal places the real root of 


X* -f 4 sin X — 0. 

Solution. Since the term x^ is positive for all real values of x, it is 
evident that the equation will be satisfied only by a negative value of x. 
We find from a graph that an approximate value of the root is — 1.9. 
Sin<:je f(x) — x* -1- 4 sin x and /'(x) — !«x 4 cos x, we have from (1) 

(—1.9)* 4- 4 sin (—1.9) 3.61 — 3.78 

2(— 1.9) + 4 cos (— 1.9) “ — 3.8 — 1.293 

0.03. 

1.9 — 0.03— — 1.93. 

(—1.93)* 4- 4 sin (—1.93) —0.0198 

2(— 1.93) 4 - 4 cos (— 1.93) ““ — o'266 

0.0038. 

. . Uj 1.9338. 



This result is correct to its last figure, as will be shown later. 



Abt. 72] GEOMETRIC SIGNIFICANCE OF NEWTON’S METHOD 208 

Example 2. Find by the Newton-Raphson method the real root of 

3z — cos X — 1 — 0. 

Solution. Here 

f(x) — 3a: — cos x — 1, 

— 3 + sin X. 

We found graphically (Fig. 8) that the approximate value of the root is 
0.61. Hence 

3(0.61) —cos(0.61) —1 0.010 

' 3 + sin(0.61) 3.57 

0.00290. 

. • . o, — 0.61 — 0.0029 — 0.6071. 

3(0.6071) —cos(0.C071) — 1 
3 -I- sin (0.6071) 

— 0.00000381. 

.•. g, =.0.60710381. 

This result also is true to its last figure. 

It will be observed that the root was obtained to a higher degree of 
accuracy and with less labor by this method than by the regula falsi 
method. 

72. Geometric Significance of the Newton-Raphson Method. The regula 
falai method assumes that the graph of the given function is replaced by 
the chord joining {xi,yt) and (xj.y*). No such geometric assumption 
was made in deriving the formula for computing the roots by the Newton- 
Raphson method, but the formula has a simple geometric significance 
nevertheless. 

Let Fig. 11 represent a magnified view of the graph of y=-/(x) where 
it crosses the x-axis. Suppose we draw a tangent from the point P whose 
abscissa is a. This tangent will intersect the x-axis in some point T. 
Then let us draw another tangent from Pi whose abscissa is OP. This 
tangent will meet the x-axis in some point T, between T and S. Then 
we may draw a third tangent from Pt whose abscissa is OTi, this tangent 
cutting the x-axis at a point between Ti and S, and so on. It is 
evident intuitionally that if the curvature of the graph does not change 
sign between P and 8 the points T, Ti, Tz, • • will approach the point 

as a limit] that is, the intercepts OT, OTi, OT*, • • will approach the 

intercept OS aa a limit. But OS represents the real root of the equation 


U 



M4 


SOLUTION OF NUMERICAL EQUATIONS 


[Cbav. X 


whose graph is drawn. Hence the quantities OT^OTifOTt,' • • are 
saocessire approximations to the desired root. This is the geometrie 
significance of the Newton-Baphson process. 


Y 



To derive the fundamental formula from this figure let MT-^hi, 
TTi — ^2, etc. The slope of the graph at P is f(a). But from the figure 
we have 


Hence 


PM — /(o), and slope at P 


tan Z ATP — 


M 

Ai • 


f(«) 


/(a) 


or 




M 

f(a)’ 


which is the fundamental formula of the Newton-Baphson method. From 
the triangle PiTTi we find in exactly the same way 


hs 


fiat) 
fiat) • 


From the preceding discussion it is evident that in the Newton-Baphson 
method the graph of the given function is replaced by a tangent at each 
successive step in the approximation process. 



An. 73] INHERENT ERROR IN NEWTON METHOD MS 

The Newton-Saphson method shonld not be need when either f{x) — 0 
or f*(x) — 0 near the desired root. Use the method of enlarged graphs 
(Art. 70) in such cases. 

78. The Inherent Error in the Kevten-Snphsen MetiML If a is an 
approximate value of a root of f{x) — 0 and h is the necessary correction, 
so that f{a -{- h) — 0, then we ^ve by Art. 71 

(1) /(•)+»f(«)+Yr(<>+«)-o, o<*<i. 

In the Newton-Baphson method we neglected the term involving h* and 
got an approximate value hi from the equation 

(2) /(«) + *.f(«)-0. 

Subtracting (2) from (1), we have 


(h - hi)f(a) + y r(« + 6k) - 0. 


(3) 


h — h, fc* 


r(a-^6k) 

2f(a) 


Now since h is the true value of the required correction and hi is its 
approximate value, it is plain that h — A, is the error in hi. The error in 
hi is thus given by (3). Let M denote the maximum value of f'{x) in 
the neighborhood of a + hi. Then 


(4) 


h — h, 


A*3f 

2f(a)- 


Our next problem is to express this error in terms of the known quantity hi. 


Clearing (4) of fractions and transposing, we have 


AfA* + 2f (o)A — 2f (o)Ai. 


-na)-\-y/[na)V-h2Mria)h, 

il 


-:^[-r(«)+r(«)(n- 


2AfA,\'i-| 

m) J' 


Now expanding the quantity [1 -)- 2Mhi/f(a)Y^ by the binomial theorem, 

we have 


SM 


SOLUTION OF NUMERICAL EQUATIONS 


[Crap. X 


1+^-1 ^+i ,^)] 

1^.) 


. m,* , M*h^^ 

“■ *1 — S57I\ + 


Hence 

(«) 


*fw ^ *trw]* 

Error — h — fci — 




*f(«) ^ scrw]' • 


Since hi is always a small decimal, it is evident that the principal part 
of the error is contained in the first term on the right-hand side of (5), 
so that we may neglect the term involving hi*. The formula for the error 
thus reduces to 


( 6 ) 


El 



This is the error in ai. The error in a% is therefore 


(7) 


En^ 


Mhn‘ 

2f{an.i) 


Now in most equations which one would solve by the Newton-Raphson 
method the quantity M/2f{a) is not greater than 1. Suppose, therefore, 
that I M/2f(an.i)\ ^ 1. Then (7) reduces to 

(8) I I ^ V- 


This result is most important; for it tells us that if h« begins with m 
zeros when expressed as a decimal fraction, then h»* begins with 2m zeros. 
This means that when the first signiQcant figure in h is less than 7, we 
may safely carry the division of f{an-i)/f{att-\) to 2m decimal places; for 
the error in the quotient will be less than half a unit in the 2mth decimal 
place. Stated otherwise, the number of reliable significant figures in h ts 
equal to the number of zeros between the decimal point and first significant 
figure, provided the number of reliable figures in both /(o^i) and f(an.i) 
is as great as the number of zeros preceding the first significant figure in h. 


m 

'fW 


the 


Hence in finding the correction h from the relation h — 

divisions of /(a) by f(a) should be carried out to only one more significant 
figure than the number of zeros between the decimal point and first 
significant figure. 

We thus have a simple method for determining the accuracy of the roots 



Art. 74] SPECIAL PROCEDURE liDR ALGEBRAIC EQUATIONS 807 

found by the Newton-Raphson method, and this fact makes this method 
much superior to the regula falsi method when the root is desired to 
several decimal places. 

It is now clear why we were able to say in Exs. 1 and 2 of Art. 71 that 
the results obtained were true to the last figure in each case. 

74. A Special Procedure for Algebraic Equations. Many algebraic 
equations can be solved by first removing some of the known real roots 
by synthetic division and then solving the resulting depressed equation 
by the easiest method available. For example, if the depressed equation 
is a quadratic, it can always be solved by the quadratic formula. The 
following example illustrates the method. 

Example, Find all the roots of 

a;‘ — 26z* + 49x — 25 — 0 

Solution, Since the graph of y — /(x) — x* — 26x® -j- 49x — 25 crosses 
the y-axis at (0, — 25), it is plain that the given equation has at least 
two real roots. By assigning integral values to x and computing the 
corresponding values of /(x), we find that these roots are near — 6 and 4, 
respectively. They are found more accurately by the Newton-Raphson 
method to be — 5.916 and 3.876. Now removing these roots from the 
given equation by synthetic division, we have 

1 0 —26 49 —25 

— 5.916 34.999 —53.238 25.072 (—5.916 

1 -5.916 8.999 —4.238 (3.876 

3.876 —7.907 4.233 

1 —2.040 L092 

Neglecting the remainder terms in each division, we have 

x* — 2.040x-|- 1.092 — 0 ' 

for the depressed equation. Solving this by the quadratic formula, we get 

X — 1.020 ± 0.227t 

as the remai nin g roots. As a check on the computation, we have 

Sum of roots — 0.000, 

Product of roots — — 26.038, 

which is a satisfactory check. 




t08 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


It is sometimes desinble to know the nature of the roots of a cnhic or 
quartic equation before attempting to find them. For a thorough diseMsioa 
of the nature ot the roots of these equations, see Burnside and Panton’s 
Tkeorf of Equationot V<d. 1. 

79. The Method ef Iteraties. When a numerical equation f(x) 
can be expressed in tiie form 

( 1 ) * — ♦(*), 

the real roots can be found by the process of iteration. This is the method 
which was used for inverse interpolation in Art. 84. The process is this: 
We find from a graph or otherwise an approximate value of the desired 
root. We then substitute this in the right-hand member of (1) and get 
a better approximation given by the equation 

Then the succeeding approximations are 


We shall apply the process to two examples. 

Example 1. Find by the method of iteration a real root of 

2x — logio X — 7. 

Solution. The given equation can be written in the form 

® — + 7 )- 

We find from the intersection of the graphs yi ■■■ 2x — 7 and j/t — • logio x 
that an approximate value of the root is 3.8. Hence we have 

*(») — I (log 3.8 + 7) — 3.79, 

»<«) _ i (log 3.79 -t- 7) — 3.7893, 

»(•) — ^(log 3.7893 -f 7) — 3.7893. 

Since x(*) is the same as x^*), we do not repeat the process but take 8.7898 
as the correct result to five figures. The iteration process is the shortest 
and easiest method for working this example. 

Example t. The method of iteration is especially useful for finding 
the real roots of an equation given in the form of an infinite series. To 



An. 751 


THE BIETHOD OF ITERATION 


SM 

find an exprewicn for the probable error (see Art. 155) of a single 
measurement of a set, one procedure is to find the real root of the following 
equation (see page 495) : 


or 

(*) 


aa* 

£-4. 

8 ^ 10 48 ^ 




*• *• 

.£- 4 . £_ 

3 10 ^ 42 


+ 




216 ^ 1820 


+ 0.4481135. 


We shall now find the value of /> to six decimal places. 

Solution. Neglecting all powers of p higher than the first, we find an 
approximate value of p to be 0.44. Hence we start with this value and 
substitute it in the right-hand member of (a). The result is 


(1) (0.44)» (0.44)» (0.44)^ 

^ 8 10 42 

— 0.4699 — 0.47, say. 

Then the second approximation is 

(.) (0.47)« (0.47)» (0.47)^ 

— 0.47554 — 0.476, say. 


(0.44) • 
216 


(0.44)“ 

1320 


-f 0.4431 


(0.47)» 

216 


(0.47)“ 

1320 


+ 0.44311 


Writing (a) in the form 


P — ♦(p)> 


we find the succeeding approximations to be 

p(») —^(0.476) —0.4767, 
pi*) —^(0.4767) —0.47689, 
pi*) —^(0.47689) —0.476927, 
pi*) . ^(0.476927) — 0.476934, 
p(T) .^(0.476934) —0.476936. 


This last value is correct to its last figure.* 

The reader will observe that the iteration process converges slowly in 
this ftTRmple. This is due to the nature of the given equation. In Ex. 1 the 
convergence was rapid. 

Note. Usually there are two or more ways in which an equation f{x) — 0 
nan be written in the form x — ^{x). It is not a matter of indifference 
as to which way it is written before starting the iteration process, for in 


* TIm TmliM of p comet to ten decimal placea ia 0.4789Sd27tt. 



210 


SOLUTION OF NUMERICAL EQUATIONS [Chaf. X 

some forms the process will not converge at all. An example of this ia 
given in Art. 81. 

76. Geometry of the Iteration Process. It is instructive to look at 
the geometric picture of the iteration process. For simplicity we denote 
the successive approximations to the root by Xo,Xi,X 2 ,x»,' ‘ ‘/Xn. Then 
the relations 

* 1 — ♦(*«) 

I 

Xi— ♦(ic*), etc., 

can be pictured as points by the following geometric construction : 

Draw the graphs of yi — z.and yi — ^(x), as shown in Figure 12. Since 
^'(z)| < 1 for convergence, the inclination of the curve y, — ^(x) must 
be less than 45° in the neighborhood of Xo. This fact has been observed in 
constructing the graph. 



M 


1 


1 



Art. 77] 


CONVERGENCE OF ITERATION PROCESS 


ill 

Now to trace the convergence of the iteration process, draw the ordinate 
^(zo). Then from the point Po draw a line parallel to OX until it inter- 
se^ the line jf, — z at the point Q\\xi, Note that this point Qt 

is the geometric representation of the first iteration equation 
Then draw QiPi, P\Qt* QtPt> PtQit etc., as indicated by the arrows in the 
figure. The points Qi, Qt, Qs ‘ * thus approach the point of intersection 

of the curves ffi — x and yt — ^(z) as the iteration proceeds. Note that 
the coordinates of these Q*b satisfy the corresponding iteration equations. 

The reader should draw a curve y* ^^(z) with inclination greater than 
45** in the neighborhood of Zo and then proceed with the construction as 
outlined above. He will find that the points Qi, Qt, etc. recede farther and 
farther away from the intersection point of the graphs and that the succes* 
Rive approximations Zi, Zj, etc. get worse as the iteration proceeds. 

77. Convergence of the Iteration Process. We shall now determine 
the condition under which the iteration process converges. The true value 
of the root satisfies the equation 

z — «^(z), 

and the first approximation satisfies 


a;(») 


Subtracting this equation from the preceding, we have 

(1) z — z(‘)-^(z)— ^(Zo). 

By the theorem of mean value the right-hand member of (1) can be 
written 

^(z) ^(Zo) ■■ (z — Zo)^'({o), Zo ^ ^0 ^ z. 

Hence (1) becomes 

A similar equation holds for all succeeding approximations, so tha t 

X — *(») . 

x-xW^(x-xW)^\4,), 


x—xM ^ (z — z<'*-*))^"(^,.,). 



tit 


SOLUTION OF NUMERICAL EQUATIONS 


[Crap. X 


Multiplying together all these equations, member by member, and dividing 
the result through by the common factors X — * — «<*>,• • -x — 
we get 

(2) *_*(•) — (*— 


Now if the maximum absolute value of is less than 1 throughout the 
interval (xo,x), so that each of the quantities etc. is not 

greater than a proper fraction m, we (^t from (2) 

(8) 1* — xW|^|x — Xo I m*. 

Since the right-hand member of (3) approaches zero as n becomes large, 
we can make the error x — x<*) as small as we please by repeating the 
iteration process a sufficient number of times. 

The condition, then, for convergence is that | ^'(x) | be less than 1 in the 
neighborhood of the desired root, the smaller the value of the more 
rapid the convergence. This condition was satisfied m Examples 1 and 
2 of Art. 75. 


78. CanwerfSBCt of tto Novtoa-Rapliam Metiiod. The Newton- 
Bapbsmi formula 


( 1 ) 




a.— 


/(«.) 

r(«.) 


shows that the Newton-Baphson method is really an iteration method; 
and since (1) can be written symbolically in the form 


OiHl — ^(o.), 

or 


we see at once from Art. 77 that the Newton-Saphson method converges 
when 

Henos from (1) wo have 





The snfBcient condition lor convergence is therrfore 





An. 79] ERROR DUE TO ERRORS IN COEFFICIENTS tl« 

or 

(*) l/(•.)r(•.)|<Ir(•.)]•• 

78. Brrm in tiie Itooln duo to Enron in tiie Coeflkients and Con* 
stint Tom. If tho ooofficionts snd oonstsnt tem in sn otjution m 
Tounded niunbon and theiefon correct to <nil 7 s certain number of 
the computed roots iHll be affected by errors due to the inaocunu^ of the 
coefficients and constant term. We now consider such errors in 
equathms and in transcendental equations. 

a) Algehraie Bquaiiont. Asanme that the a*s in the equation 

(1) /(*) — + + • ‘-l-flii-io + a,— 0 

are subject to the errors AaQ,Aai,Aat»‘ * ’>A 4 i. Then each root will be 
subject to an error Ax. 

If the exact values of the o’s are ao-{- Aa«/ii4- 
Ob 4* do» and the exact value of any root u x + Ax, then by (1) : 

(Oo + Aa»)(x 4- Ax)* 4- (oi 4* duh)(x 4- Ax)»-‘ 4- (ot 4- Aa,)(x 4- Ax)*-* 

4" ■ ■ ‘ 4" (o»-i 4" da»_i)(x -4* Ax) 4- o* 4* do* — 0. 

Expanding the powers of x 4- Ax and neglecting all terms coutaming 
products, squares, and higher powers of the errors, we have 

( 2 ) «,x* 4 -a,x*-» 4 -a,x*-* 4 -- • • 4 -ab-i* 4 -«B 4 “ 0 «oX*-*Ax 4 -x*Aa, 

4- ai(fi— • l)x*-*Ax 4 - x*-*A«i 4- 01 ( 11 — 8 )x»-'Ax 4* x*-*Aa, 4 

4 - Oh-iAx xAOii-i 4 - Aa» — 0 . 


Subtracting (1) from (2), we have 

(3) [ooAX*-^4-Oi(it — l)x***4-4(* — 2)x*-*4-* • •4-aw-i]Ax 

— — (x*Ao» 4- *^*AOi 4- *^*do, 4- • • • 4- xAOii-i 4* do»). 

Hence 

/.V Aa*_ 4~ **~*^0i 4~ • ' • 4~ 4~ ^>o») 

' ' "" OolIX*-* 4 - «l(o — !)*""• 4" Oi(» — 2)x***4*' • *4*fl»-l 

(x*Aflo *4* x*-*Aoi 4" *^*Aoi 4- * ’ * 4" “4- 

— m 

Hie maximum numerical value of Ax is obtained by taking the maximum 
numerical value of the numerator ci (4). Since tiie signs of the errora 
AotiAot,* * 'Aob are not known, we may give these errors such signs aa 
will mike all terms in the numerator positive. Then we have 



214 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


I ®*Aao I + I | + | *^*^ 0 , 1 + • • • + | xAa^i 1 + I Ao» 

“ ir(»)i 

If 7 represents the greatest numerical value that any of the errors AOo, Aa^, 
etc. can have, then 

A-c [|j*l + |j!*-»l + l»«-*l + - ■ ■ + kl + ih 

W “= |^(,)| 

The numerator of ( 6 ) can be simplified by means of the remainder 
theorem of algebra. That theorem shows that the function — 1 is 

exactly divisible by x — 1 , so that 

— 1 
JL 


Applying this result to the numerator of ( 6 ), we get 


( 7 ) 




Z*** I — 1 
I Z I — 1 


■ \r%)\’ 


This formula gives an upper bound for the error in each root of ( 1 ) when 
the appropriate value of z is substituted therein. 

It is to be noted that (3) is merely the differential of ( 1 ) when the 
a’s and z are all treated as variables. 


Example. In the equation 

z * — 1 . 842Z * — 3 . 740Z + 1.857 — 0 

the coefficients and constant term are rounded numbers and correct to three 
decimal places. The roots to three decimal places are 


Xt — 0.321, z, — — 1.432, z, — 2.963. 

f 

The upper bound of the error in each root is required. 
Solution. Here the value of 7 is 0.0005, and 


Then by (7) 


fix) — 3z * — 3 . 684Z — 3 . 740 . 


^ ( 0 . 321)‘— 1 
= 0.321 — 


0.0005 

4.61 


0.000158 


Az,^ 


( 1 . 432 )‘ — 1 
1.432 — 1 


X 


0.0005 

7.69 


< ( 2 . 963)*— 1 0.0005 

*— 2.953 — 1 ^ 11.54 


— 0.00048 

— 0 . 00166 . 



Abt. 80] 


SIMULTANEOUS EQUATIONS— NEWTON METHOD 


21ft 


Hence the roots may Be taken to be 

*1 — 0.321, *, — — 1.432, *, — 2.96. 

6) Transcendental Equations. As formula (7) does not apply to trans- 
cendental equations, such equations must be investigated individually. For 
example, if in the equation 

(8) aln* -f- bsin* — c 


a, b, c, and * are all subject to error, we find by differentiation 


dx 

a 1- In xda 4- b cos xdx + sin xdb — dc, 

X ' 


from which 


dx — 


dc — In xda — sin rdb 

® 1 I. 

— h ocos* 

* 


If II is the upper limit of the errors da, db, dc, we have 

(9) l + |ln^l+ , 

— I- 6 cos * 

* 

Example. If in (8) « — 3.21, 6 — — 0.35, c — 1.57 and these are 
rounded numbers correct to the number of decimals given, and * — 1.813, 
let us find dx. 


Solution. Since 0.005, we have by (9) 

,,^g(i±M9 50 + 0.9m )M0^ _o.oore. 

^—0.36 X 0.2398 

Other transcendental equatioiA can be treated in the same manner. 

II. SIMULTANEOUS EQUATIONS IN SEVERAL UNKNOWNS 

The real roots of simultaneous algebraic and transcendental equations in 
several unknowns can be found either by the Newton-Raphson method or 
by the method of iteration. We shall give an outline of each method for 
the cases of two unknowns and three unknowns. The reader will have no 
difficulty in extending both methods to the case of any number of unknowns 
should the necdbity arise for doing so. 

80. The Newton-Raphson Method for Simultaneous Equations. Let 
us consider first the case of two equations in two unknowns. Let the given 
equations be 



SOLUTION OF NUlfBRIGAL EQUATIONS 


[Chur. X 


( 1 ) 

(») ♦(»»jr)—o* 

Now if ft be •pprozimate valnM of a pair of loots and A, i; be oomctiou^ 
ao that 

*—*• + *> 
f — + 

thm (1) and (2) become 

(8) ♦(*a + b,f« + fc)— 0, 

(4) + b, ft + b) “ 0. 

BTpaiiAmg (3) and (4) by Taylor's theorem for a fnnction of two Tariable^ 
we hare 

(6) ^(x, + b, ft + b) — ♦(*•, fo) + b + b 

-|- terms in higher powers of b and b 0. 

(8) i^(*o + b, ft + b) — V'(**»Sf«) + b b 

terms in higher powers of b and b 0. 

Now since b and b are relatively small, we neglect their squares, products, 
and higher powers, and then (5) and (6) become simply 

(T) 

Sdving these by determinants, we find the first corrections to be 


-*(*»»> (I). 




( 10 ) 


9 



Abt. 80] 

SIMULTANEOUS EQUATIONS— NEWTON METHOD 

m 

where 



(H) 

(M). (^). 



(i). {%). ■ 



Additional oorreetiona can be found by iqieated ^plicationa of thaae 
fonnulaa with the improved valnea of x and y anbatituted at each atqk 
The notation (34/3x)o means the value of d^/dx when and y» are 
substituted for x and y. Similarly, (d^/dx)i means the value of d^/9x 
when X — y — y‘** ; and so on. 

In the case of three equations in three nnknavaa, 

o» 

let h, h, I, denote corrections to the approximate valuea y,, m,, reapec- 
tively. Then proceeding exactly as in the case of two equations, we get 
the three simple equations 

♦(*“»“••)+* (te).+ * (^).+ * (h).- *’ 

♦(*., y.^)+h (I) + 1 (^) + 1 (^) - 0 , 

»<** »“•) + * (l).+ * (|).+ * (^).- ®' 

for determining the first corrections hi, hi, Z,. The process may be repeated 
as many times as desired. 

We shall now apply this method to a pair of simnltaneoua equations^ 
one tranaOOndental and the other algebraic. 

BxampU. Compute by the Newton>Baphson method a real aolntum of 
the equations 

C a + aiogM*— y* — 0, 

{Sx* — ay — fo + l—O. 

Soh$tioH. Ob plotting tlie graphs of these equations on the same aet 
of axes, we find that they intersect at the points (1.4, — 1.6) and (3.4, 3.3). 
We shall compute the second set of values correct to four decimal placaai 
Let 

(1) »4-8lqgwa— y*. 



S18 


SOLUTION OF NUMERICAL EQUATIONS 
y) — 2x* — xy — 6x -|- I* 


[Chap. X 


Thw 


■— 1 -4- ““ » where M — 0.43429, 
ax ^ * 


ax'“4x y ®»3y*- *• 

Now since Xo •— 3.4, yo — 2.2, we have 

♦(®o, yo) — 0.1545, tf>{Xo, yo) — — 0.72, 

Substituting these values in (9), (10), (11), wc find 

hi * 0.157, it, — 0.085. 

Hence 

x<*> — 3.4 + 0.157 — 3.557, y“> — 2.285. 

Now substituting x**’ and y*'* for x and y in ^(x, y), ^(x, y), d^/dx, 
etc., we get 

^(a;(*), y(‘)) 0.011, ^(*<‘>, y<‘> ) — 0.3945, 

(I?).— 

(07).- 


3.557. 


Substituting these in (11), (9), (10), we get 


Hence 


fcj — — 0.0685, fc, 0.0229. 

x<*> — 3.4885, y<*> — 2.2621. 


Repeating the computation with these improved values of x and y, we 
find 

hi — — 0.0013, hi — — 0.000561. 



Am. 81] SIMULTANEOUS EQUATIONS— ITERATION METHOD SIO 

Hence the third approzimatiofns are 

x<*> — 3.4782, y<«> — 2.26154, 

and these are correct to the last figure. 

81. The Method of Iteration for Simultaneous Equations. In the case 
of two equations 

— o> 

we first write the given equations in the forms 

y — Fa(x,y). 

Then if Xo, yo be the approximate values of a pair of roots, improved values 
are found by the steps indicated below; 


1st 

_f'i(xo,yo), 

approx. 

|y<»)-F.(x<« yo); 

2nd 

fx<*) — Fi(x<*>, y<*>), 

approx. 

|y(*)— F*(x<*), y»>) 


etc. 


If we are given three equations 

— 0 , 
— 0 , 

x(*,y,*) — 0, 

we would first write them in the forms 


X — Fi(x,y,*), 
y~~P,{x,y,z), 
z — F,(x,y,z). 


The successive steps in the computation would then be: 


1st 

approximation 


— i^’»(Xa,yo,S*), 

- y<*> — #',(x<»>, y*, Mt), 
s<*> — F,(x<»>,y<*>,Zo); 


15 



280 

SOLUTION OF NUMERICAL EQUATIONS 

[Chap. X 

8nd 

approximation 

rx<*)— f'»(x<»>,y<*>,sW), 

Jy<«)_JP,(x<«»,y<*),,(i)), 

lsW-J’,(x<*>,y<«),fW); 



etc. 


We shall now apply the interation process to the pair of equations which 
we have already solved (for one pair of roots) by the Newton-Raphson 
method: 

y) — » + 8 logio » — y*, 

^(*»y) —2** — xy — 6a; + 1. 

Solution. We start with the approximate values Xq ~ 3.4, — 2.3, as 

indicated by the intersection of the graphs. In our next step we are 
confronted with several possibilities, for the two equations can be writen 
in the forms « — ^i(x,y), y — F,(x,y) in several ways. In the absence 
of further information we start out with the simplest forms, namely 

* — y* — SlogioX, 

y — - + 2x — 5. 

X 

Then we have 


— (2.2)* — 3 log,® 3.4 — 3.26, 
y“’ - ^5 + 2(3.25) — 5 - 1.81 ; 
x<*> — (1.81)» — 31ogi*(3.25) —1.74, 
yf*»— +2(1.74) — 5 — 0.95. 

These values of x and y are evidently getting worse with each application 
of the iteration process. We must therelore write the given equations in 
some other form before attempting the iteration process again. 

Without trying all possible forms we make a fresh start with the only 
forms that will make the process converge, namely 

+»)-■ . 

y — Vx + 31ogto X. 


Then the successive approximations are 



Aw. 82] 


CONVERGENCE OF ITERATION PROCESS 


221 


3.426, 


yO) . V8.426 3 iQgu 3.426 — 2.243; 


y(*> _ V 3.461 4- 3 logio 3.461 — 2.2606; 
*‘•>—3.466, y‘»> — 2.266; 

*<•)— 3.476, y“> — 2.268; 

*‘•>—3.480, y‘^>— 2.269; 

*‘«> — 3.483, y‘*> — 2.260. 


Here it is evident that the iteration process converges very slowly in thia 
example, for after having applied the process six times we have added 
only one reliable figure to the approximate roota with which we started. 

Tliis example brings out two important facts in connection with the 
method of iteration. The first is that we must not start out blindly in 
working a problem by this method, for instead of improving the roots at 
each step we might make them decidedly worse. The second important 
fact brought out is that the iteration process should not be applied at all 
in some examples, for the convergence might be too slow, as was the case 
above. All this leads us to a consideration of the conditions under which 
the process converges. Having these conditions at hand, we can decide 
in advance as to the advisability of attempting a problem by iteration. 

82. Convergence of the Iteration Process in the Case of Several 
Hnlmowna. To find the sufiicient^ conditions for convergence in the case 
of two equations, we write them in the forms 

* — Fx(*,y), 

y — P,(*,y). 

These equations are satisfied by the exact values of the pair of roots *, y. 
The first approximations satisfy the equations 

x‘*> — Fi(*o,yo), 
y‘»> — F,(**,yo). 

Subtracting these equations from the corresponding equations above, we 
have 


( 1 ) 


* _ x‘ •> — F» (*, y ) — F, (*«, y,) , 



( 2 ) 


SOLUTION OF NUMERICAL EQUATIONS 
y — — Ft(x, y) — F,(xo, yo). 


{Chap. X 


Now applying to the right-hand aide of the first equation the theorem of 
mean value for a function of two variables, we have 

^i(*>y)— ^i(*6,yo) — (*— «o) ^ +(y— yo) 


, 1 , 


where 



3F, [x» -f 0{x — x„), yo + $(y — y„) ] 

3x 

dx 

and 


dP, 

9F,[xo-|-a(x — Xo),yo-f fl(y--yo)l 

dy 

dy 


In a similar manner we get 
Ft(x,y) —Ft(xo,yo) 


t V 9 /* f . . 9P3 


Substituting these expressions for the right-hand members of ( 1 ) and 
( 2 ), we get 


ay * 


^ +(y-y.) 
y-yU) - (x-x,) -H (y-yo) ^ . 

Adding these two equations and considering only the absolute values of 
the several quantities, we have 


(3) 


Now let the maximum value of * either | dFi/dx \ -f- 1 dFt/dx | or 
I dFi/dy I -|- I dFt/dy \ be a proper fraction m for all points in the region 
{xo,x) and (yo, y). Then (3) becomes 

I + |y_y(») I |a;_a;, I 4 . 

This relation holds for the first approximation. For the succeeding 
approximations we have the similar relations 

I x_x<*) I 4 . 1 y_y<*> I ^ m{ j ® ] 4 - | y-;-y<*) | ), 

I * — i -t- I y — y<*> I ^ m{ \x — | 4 - | y — y<*> | ), 


dPx 

1 

dPt 

dx 

+ 

dx 

dPi 

+ 

dPt 

ay 

ay 


I X — x*"* I -f- 1 y — y<»> I ^ m{ I x — | 4 - 1 y^i— y<*-‘) | ). 

Now multiplying together all these inequalities, member by member. 



A«t. 82] 


CONVERGENCE OF ITERATION PROCESS 


and dividing through by the common factors { | z — | + | y — 

{ I * — x<*> I + I y — )» etc., we get 

I I -I- I y — y(») I ^m-{ |x — x* | + | y — yo| }• 

Since m is a proper fraction, it is clear that we can make the right- 
hand member of this inequality as small as we please by repeating the 
iteration process a sufTicient number of times. This means that the errors 
|x — X*"’ I and | y — y*”* | can be made as small as we like. 

The iteration process for two *unkiiovns therefore converges when the 
two conditions | ZFx/Zz \ | dF^/dx | < 1 and | dFt/dy \ -f- 1 dFt/dy | < 1 

hold for all points in the neighborhood of (x„,yn). In order for the con- 
vergence to be rapid enough to make the method advisable in any given 
problem it is necessary that each of the quantities | df ,/dx | -|- | 0F|/9x | 
and I dFi/dy \ -f- 1 dF 2 /dy \ be much less than 1. 

We are now able to see why the convergence was so slow in the example 
which we attempted to work by the iteration process in Art. 81. For that 
example the values of the quantities named above arc 

4 0 825, 

0.162. 

The first is much too large for rapid convergence. 

EXERCISES X 


pF, 

+ 


1 dz 

0x 

dFx 

4- 

dF, 

dy 


dy 


— 0.521 -I- 0..3C 

— 0.162 -f 0 — 


1. Find graphically or otherwise the approximate value of a real root 
of the equation 

2x — logip X — 7. 

# 

8. Find the approximate value of a real root of 


• 1 . 10 n 

xsinh — — 16 -i- 0. 

X 


8. Compute to four decimal places by the interpolation method the root 
found approximately in Exercise 1 above. 

4 . Do the saifie for the root found approximately in Exercise 2. 

6. Find to six decimal places by the Newton-Raphson method a real 
root of 


2x — 3 sin X — 5 — 0. 



224 


SOLUTION OF NUMERICAL EQUATIONS 


[Chap. X 


9, SolTe X — 0.21 sill (0.6 -f* *) by the iteration process. 

7. Find to three decimal places the smallest positire root of 

*• -f- 2* — 6. 

8 . Find the smallest positive root of 

X tea s — 1.28. 

9. Find to five decimal places a root of 

slogios— ■ — 0.126. 

10. Find all the roots of 

S^-^tix — 40 — 0. 

11. Compute to six decimal places a root of 

69— 6siiih9 — 0. 

18. Find the smallest root of 

X* X* X* X* 

^ ^ ^ ^ ( 6 !)* 

18. Find a real solution of 

4.2x* + 8.8y*— 1.48, 

(x — 1.2)*+ (y — 0.6)» — I. 

14. Find to five decimal places a solution of 

sin X — y — 1.82, 
cosy — X — 0.86. 

18. Find all the roots of 


^(x*-i)*-0.- 


16 . 


If a table of tangents for radian arguments is at hand, find two real 
roots of 

4x* + 12x* tan x* — 8 — 0. 


Suggestion. Put x* — y, solve the resulting equation for y, and then 
find X. 



CHAPTER XI 


GRAEFFE’S ROOT-SQUARING METHOD FOR 
SOLVING ALGEBRAIC EQUATIONS 

83. Introduction. The methods given in the preceding chapter, except 
that of Art. 74, are applicable only for finding the real roots of numerical 
equations. It is sometimes necessary to find also the complex roots of 
algebraic equations. In studying the stability of airplanes, for example, 
it is necessary to solve linear differential equations with constant coefficients. 
The solution of such a differential equation is effected, as is well known, by 
first solving an algebraic equation whose degree is equal to the order of 
the given differential equation. The algebraic equations which arise in 
stabilily theory are usually of the fourth, sixth, or eighth degree. A pair 
of complex roots indicates an oscillation, the real part of the root giving 
the damping factor and the imaginary part the period of oscillation. 

No short and simple method exists for finding the complex roots of 
algebraic equatioiu of high degree. Probably the root-squaring method of 
Graeffe * is the best to use in most cases. This method gives all the roots 
at once, both real and complex. 

84. Principle of the Method. The underlying principle of Graeffe’s 
method is this : The given equation is transformed into another whose roots 
are high powers of those of the original equation. The roots of the trans- 
formed equation are widely separated, and because of this fact are easily 
found. For example, if two of the*root8 of the original equation are 3 and 
8, the corresponding roots of the transformed equation are 3”* and 2**, 
where m is the power to which the roots of the given equation have been 
raised. Thus, if m — 64, we have 3** — lO*® ***, 2®* — 10‘® *®®. The two 
roots of the given equation were of the same order of magnitude, but in 
the transformed equation the larger root is more than a hundred billion 
times as large as the smaller one. Stated otherwise, the ratio of the 
roots in the given equation is but in the transformed equation it is 
10»® *®®/10»® ®*® — *1/10“ ”, or 2®®/3®® < 0.00000000001. The smaller root 
in the transformed equation is therefore negligible in comparison with the 
larger one. The roots of the transformed equation are said to be separated 


226 



226 ’ GRAEFFE'S ROOT-SQUARING METHOD [Chap. XI 

wHm the ratio of any root to the next larger is negligible in eotnparieon 
with unity. 

86. The Root-Squaring Process. The transformed equation is obtained 
by repeated application of a root-squaring process. The first application 
of this process transforms the given equation into another whose roots are 
the squares of those of the original equation. This second equation is then 
transformed into a third equation whose roots are the squares of those 
of the second, and therefore the fourth powers of those of the original 
equation. The root-squaring process is continued in this manner until 
the roots of the last transformed equation are completely separated. 

We shall now explain the root-squaring process and show the method of 
applying it. 

Let the given equation be 

(85. 1 ) f{x) — a^x* -f OiX"-‘ + Ojx*-* -f * • * + On-iX -+-0,-0. 

Then if x,, Xj, * - * Xn be the roots of this equation, we can write it in the 
equivalent form 

(1) /(x) — Oo(x — Xt) (x — X,) (x — X,) • • • (x — Xn) = 0. 

Now let us multiply (1) by the function 

( 2 ) (_i)«/(_x)_(_i)-a,(-x-x.)(-x-x,)- • (-x-x.) 

— a,(x -}- Xi) (x -f- Xi) • • ’ {x + Xn). 

The result is 

(86.2) (_ !)»/(- x)/(x) -a,*(x*-x,*)(x*-x,») • • • (x* — x»>). 
Let x’ — y. Then (78. 2) becomes 

(3) ^(y) -ao‘(y-»i*)(y- V) • • • (y-*.*) -0. 

The roots of this equation are Xi*, Xj^ ■ ■ Xn* and are thus the squares 
of the roots of the given equation (85.1). Hence to form an equation 
whose roots are the squares of those of fix) — 0, we merely multiply 
/(x)-Oby (-l)-/(_x). 

This multiplication can be carried out in a simple routine manner, as 
we shall now show. Let us first consider the sixth-degree equation 

f(x) — ttox* -f Oix* -4- a,x* a,x* (i*x* -|- OiZ -f a« — > 0. 

Then 

( — !)•/( — *) — — aiX*- 4 - 0 tX* — a,x*-}-a«x* — o,x-4-fl,. 

By actual multiplication we find 



Abt. 85] 


THE ROOT-SQUARING PROCESS 


(— 1)7(— »)/(*) — 0«**“ — 0,* 

*“ +0,* 

ft 

1 

"4* 

— SoiOs 

SOsOs 


+ 2a«fl4 

2osOb 



+ 2oaa. 


+ a,' 
— *a*a. 


— o»* 


®* + Oi* — 0. 


Let aa conoider next a seventh degree equation, 

/(*) — Oo*^ + O,** + O*** + «*«* + ®4»* + a»!E* -f- OaX + At ^ 0. 


( — 1)7 ( — *) ■■ — tti** + OjX* — Os** Os** — Oi** + o«* — flf • 

Multiplying these equations together in the ordinary manner, as before, 
we find 


(«) (-l)7(-*)/W-«.W*-a,- 

*“ 4-0,* 

**• — Os* 

+ SOofli 

— 20|0s 

4“2o,o« 


-f- SOoOi 

— SoiOs 
4-2ooaa 


— 2aias — SobOt | 

4* Zoiiit — 2osOt 

— SoiOt 

A glance at equations (4) and (5) shows that the law of formation of 
the coefficients in the squared equation is the same whether the degree of 
the given equation be even or odd. In practice the multiplication is carried 
out with detached coefficients as*indicated below: 



Oe 

Oo* 





ORAEFFE'8 ROOT-SQUARING METHOD 


[Chap. XI 


The ooeflkientB in the new equation are the some ho, hi, ho,' ' • h» of the 
several columns in the scheme above. These ooeflScients can evidently be 
written down according to the following rule: 

1. The numhen tn the top row are the equaree of the coeffidente directly 
above them, with alternating eigne — the eeeond, fourth, eixth, etc. equared 
wumhere being negative. 

2. The quaniUiee directly under theee equared numhere are the doubled 
producte of the coefficiente equally removed from the one directly overhead, 
the firet being twice the product of the two coeffieiente adjacent to the one 
overhead, the eeeond the doubled product of the next two equally removed 
eoeffieiente, etc. 

3. The eigne of the doubted producte are changed alternately going 
along the rowe and aleo in going down the columne, the eign of the firet 
doubled product in each row not being changed. 

We shall now apply Graeffe’s method to three cases of algebraic equations. 

86. Case I. Roots all Real and Unequal. Since the relations between 
the roots Xi,Xi,- * 'X» and coefficients 0 ^, 0 ,,' * * Om of the general equation 
of the nth degree 

a^ -f + * • • + On-i® + — Q 

are 

^ — • • •+*»)» 

So 

? — + (*i»* + »i»» + •••)» 

«• 

^ — (SiXsXa + XiX^„ • • •)> 


— — (— • •»«, 
do 

it follows that the roots Xi", Xi**, * ■ * Xn** and coefficients bo, bi,- ■ ■ bn of 
the final transformed equation 

6o(x")» + hi(x")"-* • • • + 6«-iX* + — 0 

are connected by the corresponding rdations 



— (Xi* + Xt* + • • • +x»"') 


— Xj* 







AST. 86] 


ROOTS REAL AND UNEQUAL 


+ *i"*i**4" + • • ') ■■ — + * ’ * 


1^— (— l)"Xx"X,*»- • -x.* 

8o 

Now if the order of magnitude of the roots ia 

I Xi I > I X, I > 1 X, I • • • > I X, I , 

it is evident that when the roots are sufficiently separated the ratios 
Xj^/xi", XiVxi", etc. are negligible in comparison with unity. Hence 
the relations between roots and coefficients iu the final transformed equa- 
tion are 

IT— — Xi" IT — Xi^X," ^ — — Xi'"X,'"X," 

Oq Oq Oq 

— (— l)"xi*x»*x,*» • • • x»». 

Oq 

Dividing each of these equations after the first by the preceding equation, 
we obtain 



Hence from these and the equation bi/bo — Xi**, we get 

(80. 1) 5oX," + 6, — 0, bix," + 62 — 0 , 6,x,* + 6, — 0, • • • 

bn-lXi," -f- hn ““ 0. 

The root-squaring process has thus broken up the original equation into 
n simple equations from which the desired roots can be found with ease. 

The question naturally arises as to how many root-squarings are necessary 
to break up the original equation into linear fragments. The answer ia 
that the required number of squarings depends upon (1) the ratios of 
the roots of the«given equation and (8) the number of significant figures 
desired in the computed roots. Since the required roots, and therefore their 
ratios, are not known in advance, it is not possible to determine beforehand 
just how many times the root-squaring process must be repeated. This, 
however, is a matter of no importance, for in pnuUet vs eonfintia th$ root* 



280 ORAEFFE’S ROOT-SQUARINQ METHOD [Chap. XI 

tfmrvng froeesi untU th» dovhWd produeta in tha aaeond row haoa no affaei 
on tha eoafieUnia of the next tranaformed aquation. 

Since the coefSciente in the given equation are not in general all poaitive^ 
the signs ot the doubled products will not occur in regular order as in tile 
literal equations which we used to illustrate the root-squaring process. The 
possibilities of making a mistake in the signs of these products are great, 
and therefore some scheme should be adopted to prevent such mistakes. As 
a convenient notation for reminding ns at each step as to whether or not 
the sign is to be changed we shall write a ** c ” after each term in which the 
sign is to be changed and an n” (for no change) after each term where 
the sign is not to be changed. 

Furthermore, as the root-squaring process necessarily increases the coeffi- 
cients in the transformed- equations until they become enormously large 
numbers, we shall always write these coefficients as simple numbers multi- 
plied by powers of 10. 

Finally, in the successive transformations of the equations by the root- 
squaring process, we shall not write down the multiplier ( — l)"f( — x) 

as was done in the scheme on page 227 , but simply apply the rule stated 
on page 328. We shall now compute all the roots of an equation by 

Qraefle*s method. 

Example 1. Find all the roots of the equation 

1.28*» — 2.62X* — 16.1x* -f 17.8** 29.4x — 1.34 — 0. 

Solution. The preliminary work of separating the roots is given on the 
following page and should be self-explanatory in view of what has been said 
above. When doubled products are too small to be written down, a star (*) 
is written instead. 

It is evident that further squaring will simply give the squares of the 
coefficients in the last line of the table, and we therefore stop with the 
82d powers of the roots. Then by (86.1) we have the following five 
simple equations : 


(7.641 X 10*)*i” — 2.846 X 10“ — 0, 
(—2.346 X 10”)*,” + 3.95 X 10” —0, 
(8.96 X 10*»)*.“ — 8.744 X 10” — 0, 
(— 8.744 X 10”)»«” -f 2.148 X 10” — 0, 
(2.148 X lO*^)*,” — 1.176 X 10* — 0. 



Abt. 86] 


ROOTS REAL AND UNEQUAL 


281 



7JS4140* + 3.96-10*T - 8.744-10** + 2.148>10« - 1.176*10« 











































ORAEFFE’S ROOT-SQUARING METHOD 


[Chap. XI 


Solving these by logarithms, we have 

, 20 + log 2.846 — log 7.641 _ 

log X, — — - 1—2 — 2 _ 0.60916. 

Xt — 4.066. 

In a similar manner we find 

Xa — 2.991, X, ^ 1.969, X 4 — 1.0286, X| — 0.04447. 

The signs of these roots are yet to be determined. To do this we first 
apply Descartes’s rule of signs and find that there can not be more than 
three positive roots nor more than two negative roots. Then we substitute 
in the given equation the approximate values ±4, ±3, ±2, ±1, 
±; 0.04 and see whether the positive or negative value comes nearer to 
satisfying the equation. In this manner we find that the roots are 

Xi — 4.066, 

Xj — — 2.991, 

X, — 1.969, 

X4 — ■ — 1 . 0286 , 

X, — 0.0446. 

The sum of these roots is 2.050, whereas it should be 2.62/1.23 — • 2.049. 
The agreement is therefore as close as could be expected. 

87. A Check on the Coefficients in the Root-Squared Equation. All 
roots found by Graeffe’s method should be carefully checked by some 
means or other. The coefficients in the root-squared equations can be 
checked by a process due to H. Rainbow.* The root-squared equation 
( 86 . 2 ) can be written in the form 

(87.1) (_l)-/(x)/(-x) -i?(x») -Ao(x*)--f Aa(x*)-' 

+ A,(x*)»-* + • • • + + An. 

To derive Rainbow’s check formula we put x 1 and x ^ — 1 in ( 86 . 1 ) 
and (87. 1). Then we have 

k*n 

/(l) — Oo + Oi + 02 + • • • + On-l -i-fla — 2ait. 

/(—I) - Oo(— 1)- + a^{— 1)"-* + Oai— 1)»-» -f • • + a,4-i(— 1) + o, 
— 2 (— l)"-*'Ofc, 

Jk-0 ^ 

F(l) — ’“h ^»i-l 4” “ 2 

k=0 

* H. Rainbow, mathematician at the Research Laboratory of the Shell Oil Company, 
Houston, Texas. 



Abt. 871 CHECK FOR ERROR IN COEFFICIENTS SSS 

On substituting these in (87. 1), we get 

(87. 8) i A* - (- 1)-[ ia*] I i(- l)-*a,], 

0 0 0 

which is Bainbow's check fortnuJa. 

This formula is applied as follows: 

(a) Find the algebraic sum of the coefficients in any equation, either 
the given equation or an equation whose roots are powers of the roots of 

lb 

the given equation. This gives the factor 2 at . 

0 

(b) Change the signs of the coefficients of the odd powers of x in the 
equation mentioned in (a) and then find the algebraic sum of all the coefEl- 

n 

cients. This gives the factor 2( — l)*‘*Ok . 

(c) Find the product of the sums found in (a) and (b) and then 
multiply this product by ( — 1)*. The result should agree very closely 

n 

with the algebraic sum (2 A^) of the coefficients in the root-squared 

0 

equation next below the given equation. 

Example 1. Let us apply Rainbow^s check to the first root-squared 
equation in the table on page 231. 

Here 

n — 5, i:Ofc — 27.97, 2 (— 1 )*■*«»- — 1-09. 

0 • 

Hence 

(— 1)»(27.97)<— 1.09) — 30.4878. 

The sum of the coefficients in the root-squared equation is 

2 dk — 30.257. 

The lack of agreement is due to the fact that the coefficients in the root- 
squared equation are rounded numbers multiplied by powers of 10. If 
the root-squarin^process is carried through without rounding any numbers, 
we find 2d — 30.4873, as the formula requires. 

Example 8. Applying Rainbow’s formula to the coefficients in the 4th- 
power equation of p. 231, we have 



2S4 


GRAEFFE’S ROOT-RQUARINQ METHOD 


[Chap. XI 


2 afc — 80,000, 2(— !)•-*«* = — 1,696,000. 

Then 

(— l)*[2o»][2(— 1)*^*] — 186,000,000,000. 

Noir finding the sum of the coefficients in the Sth-poirer equation, we get 
2^fe 136,000,000,000. Here the agreement seems perfect, but such is not 
quite the case. The zeros occupy places of unknown digits which were 
cut off in rounding the squares and products. The only reliable significant 
figures in these sums came from the largest coefficients; that is, the 
coefficients containing the highest powers of 10. 

Although the Rainbow formula cannot give an accurate check in a 
numerical example, because the coefficients in high-powered equations are 
so large that they must be expressed by a few digits multiplied by powers 
of 10, it will nevertheless detect any large error in the process of squaring 
the roots. 

88. Case II. Complex Roots. When some of the roots of an algebraic 
equation are complex, the equation can not be expressed as a product of 
linear factors with real coefficients. Such an equation can, however, always 
be expressed as a product of real linear and real quadratic factors, each 
quadratic factor corresponding to a pair of complex roots. The root- 
squaring process can therefore never break up such an equation into linear 
fragments as in the case when all the roots are real and unequal. 

When an equation has complex roots, the root-squaring process always 
breaks it up into linear and quadratic fragments. The real roots, if any, 
are found from the linear fragments as in Case I, while the complex roots 
are found from the quadratic fragments. 

In transforming an equation by the root-squaring process the presence 
of complex roots is revealed in two ways: (1) the doubled products do 
not all disappear from the first row and (9) the signs of some of the 
coefficients fluctuate as the transformations continue. The reason for these 
peculiarities can be seen by considering a ^ical example. 

81a). Detection of Complex Roots. Let us consider an equation having 
two distinct real roots and two pairs of complex roots. Let these roots be 
Xt, ue***, rte~***; and let the order of their magnitude be 

I »i I > n > I *, I > r, . 

Then the equation having these roots is 

( 1 ) (»—»■»•■*'*) 

X (» — *•)(* — r,s*A)(» — rtS-<^)=0. 



Abt. 88] 


COMPLEX ROOTS 


236 


The eqiuiti<m whose roots are the mth powers of the roots of this eqaation 
is therefore 

(*) (y — (y — (p — 

X (y — »•■*) (y — (y — — 0, 

where y — 

On performing the indicated multiplications in (2)r, then taking out the 
factors and neglecting the ratios 

r,** Xg* rg"» X,"* ft* r,*" 
x»» ' Xi* ’ Xt* ’ r,» ’ rg* * X,* * 

since each of these is negligible in comparison with unity, we finally get 

(8) y* — *i"V + cos my,y* — x,"yi*"y* + Xi'Vt'^x,"^* 

— 2xi"ri*"x,**rj" cos my,y + Xi’Vx**x,"4-*** — • 0. 

The roots of the original equation have now been separated as much as 
they can ever be (since in deriving (3) we neglected such ratios as r^/x^ 
etc.), and the given equation has been broken up into the linear and 
quadratic fragments 

y* — Xj"*y* — 0, 

— *i"y* + 2xi"Vi’" cos mtf jy* — Xi"ri*"y* — 0, 

— X|’^i*"y* + *i"*fi*^x*"’y* »— 0, 

*i"*»’i*"*»"*y* — 2xi"*ri*"^»"*r*"* cos mdsy -J- Xi"ri**^Cg"rt** 0, 

from which we can obtain the original roots with which we started. 

Suppose, now, that we apply the root-squaring process to (3) once more, 
as shown below: 



B 

If* 

y* 

y* 

filth p. 

n 

-X,- 

2xi*ri* COB m$i 



1 

+ixi*ri* COB ffi^i 

• 

+4xi*^i** COB* fn0i 

+2zi"ri**x»" 

+4xi*^i**xi" COB fn0i 
— 4xi*^i*^t*ri* COB fii#i 
+2xi*ri*^i*r^ 

2mthp. 

■ 

— Xi*" 

+4xi*^i** cotf m9i 

— 2Xi*"ri*** 

— Xi*^l^ 


16 















288 GRAEFFE’S ROOT-SQUARING METHOD TChap. XI 



»• 

V* 


mth p. 

®i**ri*^Xi** 

— cos fn$t 



— 4xi*^i«"xi"ri* cos m$t 
-H4*i*^i**x»*r»** cos tnSi 

— 4xi**ri***i*^^ coS* m$t 


2mth p. 


— COS* mSi 

-|-xi**ri<«x^«ri<" 


It is readily seen on dividing the doubled products in each column by 
the squared term at the top that all these products are negligible except 
two in tile first row. Hence the sums of the several columns are as given 
above. This result shows why the doubled products in the first row do 
not all disappear when the complex roots are present. 

Furthermore, since 2 cos* ^ — 1 « cos 2^, we can write the coefficients of 
jf* and y in the forms 2a:i**ri*" cos 2tn$i and — 2*i*"fi*^,*"^i** cos 2m6a, 
respectively. Hence the coefficients in the last transformed equation are 
simply 

(6) 2mth p. 1 — * 1 *" + 2*i***ri** cos 2mBi — Xi*'Vi*"» + Xi*^-ri*"*X|*'* 

— 2xi*"ri*^,**4'**"* cos 2m$, -4- Xi*"^i*"^ 3 *"‘ri‘"*. 

On comparing this last equation with the one for the mth powers of 
the roots we see at once that each application of the root-squaring process 
doubles the amplitudes of the complex roots. Hence the cosines of these 
amplitudes must frequently change signs as the amplitudes are continually 
doubled. This explains the fluctuation in the signs of some of the coeffi- 
cients when complex roots are present. 

After the original equation has been broken up into linear and quadratic 
fragments by the root-squaring process, we can find the complex roots by 
solving the resulting quadratic equations for x”* and then extracting the 
mth root of the results by means of De Moivre’s theorem. But by pro- 
ceeding in this manner we would have ambiguities of sign in the computed 
roots, and such ambiguities are not easily removed. To obtain the complex 
roots without ambiguity as to signs we derive some further relations 
between roots and coefficients. 

81h). Relaiiont between the Coefficients of an Algebraic Equation and 
the Reeiprocais of Its Roots. In the general equation 


*4* fltX*"* ”4* * * * "^ " 0 
















A»t. 88] 


COMPLEX ROOTS 


297 


let tu put X » 1/y. The result, after clearing of fractions, is 

+ • • • + a»!f* + «ay* + «iy + «o — 0. 

Hence from the well-known relations between roots and coefficients (p. 228) 

we hare 

H l-yO. 

0^ 2 

“ — yiy* + y»yi + • • • + yjyt + • • • , 


0. 


(— l)"y,y,* • ffni 


or, since y — 1/x, 


( 6 ) 


f + f + 
*1 ** 



0»-l 

0, * 


_L+JL + 




+ 


Xn-iXn 


On-t 

>. ’ 


— i 

1. XiXiXt • • • X, 



These relations between the coefficients and reciprocals of the roots will 
hdp us to avoid ambiguities of sign in the computation of complex roots. 

SxampU i. Find all the roots of the equation 

x^ — 2x^ — dx* + 4x* — 5x + 6^0. 

Solution. The preliminary work of separating the roots is shown <m 
pages 228-229 and should be self-explanatory. 

It is evident from the last application of the root-squaring process that 
another application would effect no further separation of the roots. Hence 
we stop with the 266th powers of the roots. 

The given equation has now been broken up into three linear and two 
quadratic fragments. We first compute the real roots from the linear 
fragments. 

For the first real root we have by (86. 1) 


*,"•-9.084X10”, 




+ 2.603010* - 1.339610* + 1.483810ii - 0.6270-10» + 1.5129-lOU 

+ 1.7861 e + 3.9306 n + 0.1833 e - 9.4759 n - 0.8413 c 

+ 0.0770 n - 0.1222 c - 0.1255 n + 1.2298 e 













































Abt. 88] 


COMPLEX ROOTS 



256th p. 1 - 9.084- lOW + 6.472- 10i« + 2.093-10»i + 3.879- 10»w - 9.852- + 0.163- - 1.618- 10»» 




























































240 


GRAEFFE'S ROOT-SQUARING METHOD 


[Chap. XI 


from which we find by logarithms 

Xx — 1.9626. 

The second real root is fojmd from 

(— 9.084 • 10**) • *,»•• -t- 6.472 • 10‘" — 0. 
Solving this by logarithms, we find 


X, — 1.6379. 

The next two roots are complex, but the fifth, a real root, is found from 
the equation 

(8.879 • 10”») • »,“• — 9.862 • 10*»» — 0, 

from which 


Xt — 1.1080. 

To determine the signs of these roots we first apply Descartes’s rule of 
signs to the original equation and find that there can not be more than 
one negative root. The other two real roots must therefore be positive. 
On substituting in the original equation the rough values ± 2, we find 
that — 2 nearly satisfies the equation. Hence — — 1.9625. The three 
real roots are therefore 


*1 — — 1.9626, Xt — 1.6379, — 1.1080. 

f 

The modulus of the first pair of complex roots is found from the 
quadratic equation 

(a) (6.472 • 10«*)y* -f (2.093 • 10«*)y + 8.879 • 10«» — 0, 

where y — ^ x***. Let n denote this modulus. We find Ti by means of a 
simple theorem conecting the coefficients of a quadratic equation with the 
modulus of its complex roots. 

Let the quadratic equation 


(b) 


b* -f c — 0, 



Aht. 88] 


COMPLEX ROOTS 


have tlie complex roots re** and rg-**. Then 

»•+ + (« — Te**){x — rir**) 

wmg^—T{e** + r**)+i* 

(2rooBd)* + f*. 

Hence c — r*, — t — 2rco8d; that is, the absolute term in the quadratic 
(b) is equal to the square of the modulus of its complex roots. 

Let Ri denote the modulus of the complex roots of (a). Then on 
dividing the equation through by 6.472 X 10*** and applying the theorem 
just stated, we get 

„ , 3.879 X 10** 

“ 6.472 


Since, however, Bi — fi***, we have 


3.879 X 10” 


6.472 


Solving this by logarithms, we find 


r, — 1.2909. 

The modulus of the second pair of complex roots is found in like manner 
from the quadratic 

(— 9.852 X 10‘*«)y* + (0.163 X 10**«)y — 1.618 X 10*** — 0, 


(c) or 


0.163 X 10* 1.618 X 10* _ 

9.852 ^ 9.862 “ 


Denoting this modulus by rg and that of (c) by Bg, we have 


from which 


„ , 1.618 X 10* 

9:8^’ 


r, — 1.0618. 


1.618 X 10* 
9.852 


Now let the two pairs of complex roots be denoted by 

Ui + ivi, Ui — ivi and u, + ivt, u, — ivg, 

respectively. Then since the sum of the roots of the given equation is 0, 
we have 

*1 + ** + 2Ui + Xe -j- 2«t » 0, 


— — 0 . 8417 . 



242 GRAEFFE'S ROOT^UARING METHOD [Chap. XI 

We next apply the theorem connecting the sum of the redprocals of 
the roots with the coefiBcients of the given equation, namely 

1 + 1 I 1 ^ -.1-. ■ 1 , ■ __1 6 . 

®i X, ~ ttx + <Vi 1*1 — Wi X5 ^ u* + w* ttt — w* 6 

Bationalizing the denominators of the complex terms and putting 1 * 1 * + Vi* 
— ri*, «,■ 4- Wa* — r,*, we get 

1 I 1 + ^<*1 I *1 I — 5 

Xi Xj ~ f 1* "I" X, fa* 6 ' 

Now substituting in this equation the numerical values 

A ^ _ 0.608386, 

Xi 

— — 0.65023'}'4, -- — 0.902527, ~ — 0.60010, = 0.88698 

X2 ’ X, r,» r,» 

and dividing through by 2 , we obtain 

(e) O. 6 OOIU 1 + 0.92875tta — — 0.10662. 

Solving (d) and (e) simultaneously, we find 

tti — — 0.6446, tta — 0.3028. 

Vi and Oa are found from the formulas i>i — V — **1* "■ V — *h) 

and Oa — V*"** — «s* — V(va+«t)(*’f — “a) to be 

«i « 1.1186, Va — 1-Q18. 

Hence the two pairs of complex roots are 

— 0.6445 ± 1.118* and . 0.8028 ± 1.018>. 

We have thus obtained the complex roots without any ambiguity of signs. 
The computed roots in this example can be checked by snbsUtnting the 
values of the real roots and moduli in the known relation 

XiXari*x,r,* — — 6, 

or 

log(x,x,fi*x,f,*) — > log 6. 

These logarithms are found to be 

0.77816 * 0.77816. 


The agreement is thus as close as could be expected. 



Abt. 89] 


ROOTS REAL AND EQUAL 


243 


Remark. If an equation contains more than two pairs of complex roots, 
the moduli of the roots can be found from the quadratic fragments as in 
the example above. Then the real parts Ui,Ut,Ut,‘ * * can be found by 
making further use of the relations connecting the roots and the reciprocals 
of the roots with the coefficients of the original equation. 

In some equations of high degree it might be advantageous, after finding 
the real roots, to depress the original equation by taking out the real roots 
and leaving only the complex roots. This is conveniently done by synthetic 
division. The relations between the roots and coefficients of the depressed 
equation should then be used. 

88. Case III. Roots Real and Numerically Equal. If two roots of an 
equation are numerically equal, the root-squaring process can never break 
up the equation into linear fragments. One of the doubled products will 
always remain in the first row. This product will be just half the squared 
term above it, as can be seen by considering an equation of the third degree. 

Let the roots of 

(1) X* -f OiX* + tti ■“ 0 

be Xi, Xi, Xt. Then the equation whose roots are the mth powers of those 
of (1) is 

(y — Xi"*) (y — x*"*) (y — Xs") — 0, where y — x», 
or 

y* — (xi** ^ Xi** “1“ Xj"*)y* (xi"'Xj** ^ Xt**Xj**)y— Xi"Xi"'Xi* ^ 0, 

m + 

+ ( 1 + »— 

Now let Xi — Xs and let I Xi I > I Xt I . Then for sufficiently large values 
of m the ratio Xt*/*i* u negligible in comparison with unity, and (8) 
reduces to 

(8) • y* — xs"y* + 8xi"^"y — Xi"Xi*" — 0. 

The roots of the given equation have now been separated as much as 
ttiey can ever be, but we shall apply the root-squaring process to (8) to 
see what happens. Using only the ooefficiakta, we have 



244 


GRAEFFE’S ROOT-SQUARING METHOD 


[Chap. XI 


mth p. 

1 


2xi^»" 

— Xi"X|^ 


1 

-Xi** 

+4*i"Xt* 

-|-4x,*^^ 

-2xi*^^ 

— X|*"Xi*" 

2mth p. 

1 

— Xi*" 

+2xi**XiP* 

— Xi**Xi^ 


It will be noticed that the first doubled product is negligible in com- 
parison with the squared term above it, whereas the second is of the Mms 
order of magnitude as the squared term above and just half at large. 
Furthermore, in the equation for the 2mth powers of the roots all the coeffi- 
cients except one are ^e squares of those in the preceding equation. This 
remaining one is only half the square of the corresponding coefficient in the 
preceding equation. These peculiarities enable us to detect equal real roots 
immediately. We shall now show how to compute such roots. 

Example S. Solve the equation 


Solution. 


6**-|-2x* — 15x — 6— 0. 


Given equa. 

6 

2 

-15 

-6 


25 

-4 

-150 n 

225 

-♦-24 c 

-36 

2d p. 

25 

-1,5410* 

-i-2 49 10* 

-36 


6 25 10* 

-2.871610* 
+1 2460 n 

-♦-6 200MO* 
-1 1008 e 

-1 296 10* 

4th p. 

6 25 10* 

-1.126610* 

-♦-5 0993 10* 

-1 296 10* 


3.9062 10* 

-1.260 10* 
-♦-0.637 n 

+2 600 10* 
-0.029 e 

-1 680- 10* 

8th p. 

8. 90610* 

-0.632 10* 

+2 571 10* 

-1.68010* 


1.626 10** 

-3.994 10“ 
-♦-2.008 

-♦-6 610- 10“ 

« 

-2.82210“ 

16th p. 

1 526 10“ 

-1.986 10“ 

llllllQQ^ 

-2. 822- 10“ 


The given equation has now been broken up into the simple frag- 
ment (6.610 * 10'*) Xt” — 2.822 ■ 10'* — 0 and the quadratic fragment 













































Art. go] 


BRODETSKY AND SMEAL’S IMPROVEMENT 


246 


1.686 • — 1.986 • 10**®i** + 6.610 10** — 0. Solving the simple 

fragment by logarithms, we find 


X, — 0.3999. 


To find the roots of the quadratic fragment we write the equation in 
the form 


Xi! 




1.986 X10‘_,. , 6.61 X10» 

*1 H — — O. 


1.626 


1.626 


Since the roots are known to be equal and since their product is equal to 
the absolute term of the quadratic, we have 


„ _ 6.61 X 10^ 

* 1.626 • 
Solving by logarithms, we get 

— 1.732. 


We check this result by putting the sum of the roots equal to the coeffi- 
cient of with its sign changed. Since the roots are equal, we have 


from which 


2Xi“ 


1.986 X 10‘ 
1.626 ' 


X, — 1.731. 


We shall next determine the signs of these roots. By Descartes’s rule 
there can not be more than one positive root nor more than two negative 
roots. Hence we try ± 0.4 and find that — 0.4 satisfies the given equation. 
The other two roots are therefore zt 1.732. 


80. BrodetsI^ and Smeal’s ^provement of Graeffe’s Method. The 
Qraeffe method as explained and illustrated up to this point is sufficient 
for finding the real roots and one or two pairs of complex roots of an 
algebraic equation. When an equation has three pairs of complex roots, 
they can be found without much difficult^ by making further use of the 
relations between roots and coefficients; but since one of the real parts 
(it’s) most be found from a quadratic equation, thus giving two values 
of tt, the proper value must be determined by trial. When the given 
equation has four or more pairs of complex roots, the practical difficulties 
in finding them are almost insurmountable. 

Brodetsky and Smeal * avoided all these difficulties by moving the origin 



246 


GRAEFFE’S ROOT^QUARING METHOD 


[Chap, XI 


of « a small distance c and then applying the root-squaring process to the 
transformed equation. Their procedure enables all roots to be found 
without any ambiguities and without much additional labor after the 
roots of the transformed equation have been separated. The Brodetsky 
and Smeal improvement enables any number of pairs of complex roots 
to be found with the same ease as one or two pairs. The introduction of 
the auxiliary variable < more than doubles the labor of separating the 
roots, but does not cause any other additional labor in the solution. We 
present the Brodetsky and Smeal method in slightly modified form and 
apply it to two examples. 

Consider the general equation 

( 1 ) I" -f- Oi*"’' -f- ■+-• ••-}- a#-!* + o, — 0 

and put 

(2) a — z'-f*, 

where c denotes a small variable whose squares and higher powers are 
negligible. On substituting (2) into (1), we have 

-j- «)" -f- Oi (ar' -f- 02(1' 

( 3 ) 

-f • • -f a».i(ar'-f €> — 0. 

Expanding the binomial quantities by the binomial theorem to only two 
terms, thus neglecting all terms involving <*, etc., we get 

a^* Oi[a:'*’* + (n — 1 

-I- o,[*'*-* + (n — +• ■ —0, 
or 

(4) + + — l)«]a?'’''*-|“- • • — 0. 

The root-squaring process is now applied to the transformed equation 

(4) instead of the original equation (1). The root-squaring is carried 
ont by applying the rules stated in Art. 86, neglecting all c*^8 wherever 
they occur. The rules must be applied to the snftre coeffiei$nt$ (including 
the c-terms) of the a^-terms. In the root-squaring table the terms con- 
taining « will be in rows below the rows of terms not containing c. 

Before proceeding to work the illustrative examples, we digress to derive 
the necessary formulas for finding the roots from the linear and quadratic 
fragments. The fragments will contain c. When written as equations 
according to (86.1), the linear fragments will be of the form 

(5) (At -f- -f- (At -f- Btc) — 0, 

where fn denotes the power to whidi the roots of (4) have been raised. 



247 


Abt. 90] BRODETSKY AND SMEAL’S IMPROVEMENT 


From (6) we have 


Ax + 


4.(1 +|l.) 




where «' has been neglected in the division to get the expression in 
brackets. On replacing af by x — c from (2), we obtain 


(» 


Ax Ax \^i Ax/ 


Since this is an identity in <, the coefficients of like powers of c in the 
two members of the identity are equal. Hence we have 



Dividing (7) by (6) and solving for x, we obtain 


( 8 ) 


X 


— m 

At Ax 


The value of x should be found from both (6) and (8) as a check, the 
latter giving the correct sign of x. If the two values do not agree closely, 
there is some error in the root-squaring computation. 

The quadratic fragments when equated to zero will be of the form 


( 8 ) 

or 

( 10 ) 

where 

( 11 ) 


{Ax + B.«)y* + (^I + Btt)y + {At + B„) - 0 , 


^ ^Ax-hBx€^^ Ax-hBx€ 
y— (x')"*— {x — e)*"— (re*** — f)"^ 


From (10) we have 


4.(1 +7^0 4,(H-|i€) 

,( 1+^0 4 .( 1 +!^.) 


0 , 


or 



GRAEFFE'S ROOT-SQUARING METHOD 


[Chap. XI 


(I«) »•+ ^ ti + (^-!7)‘i»+t -It)*! 

where, in the division, we have neglected <* as before. 

The product of the roots of (18) equals the constant term. Hence we 
have 

[1 + _ |l).]. 

The product on the left becomes [r* — rt(e** + «"**) 1" when e* is neglected. 
Expanding this to two terms by the binomial theorem and replacing 
«** 4- l>y 2 cos d, we get 

(18) r»"— 8»nr»»-»€COsd — 


Now equating coefficients of like powers of c in both sides of the equation, 
we obtain 



But rcos8 — «, the real part of the complex root. Hence we have 


(16) 

r* /fl. BA 

SmM, A^)' 

Then 


(16) 

•s* 

1 

> 

1 


The real roots of an equation are to be found from formulas (6) and 
(8), and the complex roots from (14), (15), and (16). 

Example 1. Find the roots of the equation 

+ 3x * — *• — 8** — 2x — 1 — 0. 

Solution. We first put x — s' -)- «• equation becomes 



Aht. 90] BRODETSKY AND SMEAL’S IMPROVEMENT 249 

Expanding the binamials to two terms and rearranging the coefficients, 
we find 


+ (3 + ScK*— (1~12€)*'»— (2 + (2 + 4iy — (1 + 2s) -0. 

The root-squaring of this equation is shown in full in the table on 
pages 250-251. 


The table shows that the roots are completely separated by the 6th 
squaring (64th power). The table also shows that there are three real 
roots and one pair of complex roots, the midterm of the quadratic frag- 
ment being in the column under 

Since the 6th is the last squaring to be made, the parts of the coefficients 
not containing c’s and the parts multiplied by c are placed in separate 
lines, the first line being designated A and the latter B. The quotients 

^ are placed in a third line for use in the formulas for finding the roots. 

To find the first real root, we substitute into formula (6) the first two 
terms in line A and thereby obtain 


— 1.1528 X 10". 

Hence 

64 logx, — 32 -flog 1.1528 — 32.06175 
log Xi — 0.50096 
Xj — ± 3.1693. 

B B 

Now substituting the ratios and -p into formula (8), we get 

At Ai 


*1 — ■ 


64 


20.194—0 


— — 3.1693. 


This result agrees with that found above and also gives the sign of Xi. 

To find the second real root, we substitute into (6) the second and 
third terms of line A and thus have 


Then 


^ 1.74874 X lO" 1.74874 X 10* 

*• “ 1.1628X10” “ 1.1528 

64 log X, — 4 -f log 1.74874 — log 1.1628 — 4.18098 


and 


log Xt — 0.06638 

Xt— ±1.1628. 



Given equa. 1 (3+dt) -(1-12€) -(2+3c) — (2-f4c) — (l+2f) 


250 


GRAEFFE’S ROOT-SQUARING METHOD 


[Chap. XI 



2.472 c •!()« -20.2776c lO* 34.336c -10* -2.88c 

0.0952c n 4.7056c e -16.528c n -3.36c 

-0.0024c n -1.888c e 


























































252 


GRAEFFE'S ROOT-SQUARING METHOD 


[Chap. XI 


Then from (8) we get 




•64 


—84.867 — 20.194 


which shows that this root is positive. 

The third root is found from the last two terms of the lines A, B, 

and 'We thus have 


and 


*• "2.10729X10** 

64 log _ 14 — log 2.10729 — — 14.32372, 


1 ""14.32372 A AMMA4A -.A 

logx, — — — — 0.22381 — 9.77619 — 10, 

Xg B + 0.5973. 


From (8) we have 


Xg — 


•64 


128 — 20.913 


——0.5976. 


The complex roots are found from the third and fifth terms in rows 

g 

A and -j* Substituting the appropriate quantities into (14), we have 


, “12.10729 X 10** “/ 

" \ 1.74874 X 10“ " \l. 


2.10729 


74874 X 10” 


1 


(log 2.10729 — 22 — log 1.74874) 
o4 

— — 0.34248 — 9.65752 — 10 
r» — 0.4545. 


Then from (15) and (16) we get 

(20.913 + 34.867) —-0.1981 

V — V0.4545 — 0.0392 — 0.6444. 

Hence the cmnplex roots are 

—0.1981 ± 0.6444i. 

The five roots of the given equation are therefore 

— 3.1693, -0.5976, 1.1623, and —0.1981 ± 0.6444t. 



Art. 90] BRODETSKY AND SMEAL’S IMPROVEMENT 268 

The sum of these roots is — 3.0008 and their product is 1. The solu- 
tions found are therefore correct. 

Example £. The following equation arose in the investigation of the 
stability of a certain type of airplane.* Its solution is required. 

«• -f 20.4®» + 151.8aj* + 490*» + 687af* + 719** + 150** -f 109x + 6.87 

— 0 . 

Solution. Putting x — x'-j-c and substituting this into the given 
equation, we have 

ex' + €)• -I- 20.4(x' + «)» -f 151.3(x' 4- «)• + 490(x' + <)• + 687(x' + e)* 

+ 719(x' + «)* + 150(x' + «)» + 109(x' 4- e) 4- 6.87 — 0. 

On expanding the binomials to two terms and rearranging the coefficients, 
we get 

x'* 4- (20.4 4- 8 €)x'» 4- (151.3 4- 142.8€)x'* 4- (490 4- 907.8«)x'» 

4- (687 + 2450«)x'‘ + (719 4- 2748€)x'* 4- (150 4- 2157e)x^ 

4- (109 + 300e)x' + 6.87 4. 109« — 0. 

The successive steps in the root-squaring of this transformed equation are 
shown in the table on p. 254. To save space, only the successive equations 
are shown, with the exception of the last squaring. The complete results 
of the fifth squaring (32nd power) are shown to enable the reader to 
distinguish at a glance the linear and quadratic fragments. Under the 
middle term of each quadratic fragment is written the abbreviation “cos.” 
The table shows that there are three pairs of complex roots and two real 
roots. 


The first real root is found by formulas (6) and (8) from the first 

g 

two terms in lines A and -r. We have 

A 

x/* — 3.2920 X 10“ 

32 log Xi — 28 4- log 3.292 — 28.51746 
logx,-i 0.89117 
x» — ± 7.783. 





















































Art. 90] 
AIm^ 


BRODETSKY AND SMEAL’S IMPROVEMENT 


2S5 




Hence the required root is 


a?, — 7.78. 


The second real root is found from the last two terns in lines A and 


j. We have 


j 0.605 X 10” 6.05 

• ““ 1.8572 X lO** “ 1.8672 X 10“ 

32 log X, — log 6.05 — 38 — log 1.8572 — — 37.4871 
log a:. — — 1.1715 — 8.8285 — 10 
Xt — ± 0.06738. 


We take 


— 32 

“ 508.25 — 32.815 “ 


0.0674. 


From the first quadratic fragment and formulas (14), (16), (16), 
we have 

“1—0.8317 X 10“ "18.317 X 10“ 

‘ V— 3.2920 X 10” "" y 3.292 

log ^ (49 + log 8^.317 — log 3.292 — 1.54383 

ri* — 34.981 

34 981 

tti ^ ( 14.377 — 4.1236) - — 6.604 

»i — V34.981 — 31.4048 — 1.891. 

Hence the first pair of complex roots is 

— 6.604 ±1.891i. 

For the second pair of complex roots we have 

, _ ”1—6.7663 X 10“ f 16.7653 X 10“ 

* " V— 0.8317 X 10“ " Y SiSiT 



2S6 


GRAEFFE'S ROOT-SQUARING METHOD 


[Chap. XI 


r,* — ^ ( 1 1 + log 6.7553 — log 8.317 ) — 0.34093 
r,* — 2.19245 

2 19245 

(33.871 — 14.377) - — 0.6678 

V, — V2.19245 — 0.44596 — 1.322. 

The second pair of complex roots is then 

— 0.6678 ± 1.322t. 

For the third pair of complex roots we have 

, "1—1.8572X 10“ _ ”/ 0572 

*’* ■" \— 6.7553 X 10" ““ >6.7553 X 10“ 

log r,* — (log 1.8572 — 24 — log 6.7553) — — 0.76752 — 9.23248 — 10 

r,* — 0.1708 

0 1708 

— (32.815 — 33.871) —0.002818 

64 

V, — V 0.1708— (0.002818)* — 0.413. 


The third pair of complex roots is thus 

0.002818 ± 0.413t. 

The sum of the roots we have found is — 20.39 and their product 
is 6.872. 

The Brodetsky and Smeal method h elegant and powerful, and enables 
the roots of equations of any degree to be found uniquely ; but it has the 
disadvantage of greatly increasing the labor of separating the roots, with 
a corresponding increase in the possibility of errors in the root-squaring 
operation. Errors are most likely to occur in computing and recording 
the doubled products — by forgetting to multiply by 2, by recording the 
results with the wrong sign, etc. In a long computation such errors can 
be prevented only by the utmost vigilance on the part of the computer. 

For the reasons stated above, the Brodetsky-Smeal mdthod is not advised 
for solving equations having fewer than three pairs of complex roots. 
When an equation of the sixth or higher degree is to be solved, it is well 
to separate the roots first by the ordinary method, not using the c*s. Then 
if it is seen that there are three or more pairs of complex roots, the given 



Art. 91] 


IMPROVING ACCURACY OF ROOTS 


2S7 


equation should be transformed to x^’s and <'s and the root-squaring 
repeated. The first root-squaring can be used as a guide and partial 
check on the second. 


91. Improving the Accuracy of the Roots. The accuracy of the 
roots found by the Graeffe method can be improved to any desired extent 
by several methods, one of the best being the Newton-Raphson method. 
Although that method was applied only to the computation of real roots 
in Art. 71, it is equally applicable to the improvement of complex roots, 
as we shall now show. 

A function of a complex variable f can be represented about a point Zo 
in the complex plane by the Taylor series 


(1) «•)-/(«.) +f(«.)(*-«.)+^ (»-«.)•+■ 


2 
nl 


! — *o)" + ' 


If we put z — So — A or z — So + A, where A is now complex, the series 

(1) becomes 

(2) /(zo + A) - f(zo) + A/'(zo) + j r(zo) + • • • + ^ /<•>(*•) + • • • • 


If Zo is an approximate value of a root of the equation f(z) — 0 and 
A is a correction such that /(zo-|-A)— 0, then (2) becomes 

(3) /(Zo) + A/'(Zo) + y r(*o) + 0. 

Since the modulus of A is assumed to be small, we may neglect in (3) all 
terms higher than the first degre^ in A. Then we get 


(4) 


A — 


/(»o) 

r(*o)’ 


which is the Newton-Raphson formula. 
The improved value of z is then 


(«) 




Zo — 


/(»o) 

r(*o)' 


Example. 


Approximate values of the complex roots of the equation 


/(*) — — 3x* -1- 8x» — 5 — 0 

are 

Zo — 1.39 ±2.47*. 



268 GRAEFFE’S ROOT-SQUARING METHOD (Chap. XI 

Find more accurate values of these roots. 

Solution. Since the imaginary parts of complex roots are always 
numerically equal and of opposite sign, it suffices to improve only the 
root with positive imaginary part. Hence we take 

So — 1.89 + 2.47t. 

Then since 

f(*)— 4** — 9** + l(te, 

we have 

/(,o) . vl.39 + 2.47»)* — 3(1.39 -f 8.47t)* + 8(1.39 -f 2.47»)* — 6 

f(,„) — 4(1.39 + 8.470* — 9(1.89 + 2-47t)* + 16(1.39 -f 2.47*). 

The right-hand members of these equations can be evaluated by direct 
expansions by the binomial theorem, but they can be evaluated with less 
labor by changing the parenthetical terms to polar coordinate form and 
then using the relation 

(o -f- 6*)* — r"(cos n0 -f t sin nS). 

Using the latter method, we have 

f — V(1.39)*+ (2.47)* — 2.83425475 

sin e — — — 0.871481295 

r 

1 39 

cos 6 — — — 0.490428145 
r 

6 — 60* 37'5.2."'44. 


Then we find 


/(*.) 

f(*«) 

h 


0.1437—0.061074* 

— 31.261339 — 25.288849* 
0.1437—0.061074* 


31.26-1.25.29* 


-0.0018 — 0.0084*. 


^he corrected value of s is therefore 


s — 1.3918-1-8.4666* 
and the improved complex roots are 
s— 1.8918 ± 2.4666*. 



Abt 91] 


EXERCISES 


259 


Carvallo * has extended Oraeffe^s method to the solution of transcendental 
equations by expanding the equation into a Taylor series, neglecting the 
remaindelr term, and then treating the resulting polynomial as an algebraic 
equation. 

Note. If the leading coefficient of an equation is not unity, time and 
labor will be saved by first dividing throughout by the leading coefficient 
before beginning the root-squaring process. 

EXERCISES XI 

Find all the roots of the following equations: 

1. -f 7.18** 19.41a:* + 1.83* -1-2 — 0. 

8. 2.7662117** — 0.11123B89*-!- 1.6967115 — 0. 

8. 8.26** + 4.2** -1- 3.08** — 7.16*» -f 1.92* — 7.76 — 0. 

4. *• — 6*»-f 3**-l-6** — 6X-I-2 — 0. 

5. **-1-3** -1-6 — 0. 

6. *• + 7.73** + 12.84**— 1.111*» — 55.7** — 125.3** 

— 157.9** — 112.3* — 56.3 — 0. 


* RetolftHom Vumtrique 4e$ Eqmtieme, p. 24. 



CHAPTER XII 


numerical solution of simultaneous linear 

EQUATIONS 

Various methods have been devised for the numerical solution of simul- 
taneous linear equations. Some of the mettiods are of general applicability, 
while others are somewhat restricted in their application. Perhaps no single 
method is best in all cases. In the present chapter some of the best general 
methods are explained and illustrated by numerical examples. 


I. SOLUTION BY DBTBKMINANTS 


92. Evaluation of Numetical Determinants. In the following pages 
it is assumed that the reader is familiar with the elementary properties of 
determinants to the extent given in the usual college algebras. 

a) Expansion in terms of minors. The minor of any element of a 
determinant is the determinant which remains after deletion of the row 
and column containing the element. Any determinant may be expanded 
in terms of the minors of the elements of any row or column. The elements 
of the first row or first column are usually the most convenient for expan- 
sions in terms of minors. Thus, the determinant 


®1 

^ bi 62 

Cl C2 C3 C4 

di d2 

may be expanded with respect to the elethents of the first row as 



bz b» bt 


bg bg 


b, 6* 6« 


bi bf bt 

D = ai 

C2 Cg C4 

d* d, dt 

flj 

Cl Cg C4 
di dg dg 

-f- fls 

C\ Cg C4 

dx dg dg 

— dg 

Cl C2 Cg 

di dg dg 


or with respect to the elements of the first column as 



bi bi bt 


dg dg dg 


dg dg dg 

% 

dg dg dg 

D=zai 

Cg Cg Cg 

di di dt 

— bi 

Cg Cg Cg 
dg dg dg 

+ ^1 

bg bg bg 
dg dg dg 

— di 

bg bg bg 

Cg Cg Cg 


The resulting third-order determinants can be expanded by the rule for 
expanding determinants of the second and third orders. 


260 



Abt. 92] EVALUATION OF NUMERICAL DETERMINANTS 261 

Note that in the above expansions by minors the signs alternate, whether 
the expansion is with respect to the elements of the row or with respect 
to the elements of the column. 

Example. To evaluate the determinant 

8—126 
_ — 2 3 4 1 

1 —8 2 6 * 

2 4 3 1 

we expand it in terms of the elements of the first row and have 



3 4 1 


— 2 4 1 


— 2 3 1 


— 2 3 4 

CO 

II 

— 3 2 6 

4 3- 1 

+ 1 

12 6 

2 3 1 

+ 2 

1—3 6 

2 4 1 

— 5 

1—3 2 

2 4 3 


= 129 -h 75 + 194 — 385 = 18. 


Although the method of expansion by minors is simple and is very 
important theoretically, it is too long for practical use in determinants 
higher than the fourth order. 

h) The pivotal method. In this method of evaluating a determinant 
the order of the determinant is systematically reduced step by step, the 
leading element (called the pivot) playing a more important role than any 
other element. To derive the formula for the pivotal expansion, let us 
consider the nth-order determinant 

dl dg dt d# 

bi bt ftt ^4 

Cx Cj Ct C4 
(fx da da da 


Multiply the elements of the 2d, 3dp * * * nth columns by dx and compensate 
by divi^ng the determinant by dx**’^. Then 

dx dxdf dxda dxda* 

6x ^ibt ^1^4 ' 

(2) J9 = l/dx"’' Cl dxCt dxCa d,C4- • • 

dx Uxdf dxdg Aida* 




SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


Now multiply the elements of the first column of (1) by at,aa,’ * in 
succession and subtract the results from the 2d, 3d,' * 'nth columns of 
(2). We thus get 




Oi 0 0 0 • 

0,64 ““Oabj* 

Cl CliCj"“<ljCi £liCf*"“fltCi II1C4 II4C1 * 
di llirff ““ fljdi 


which becomes 




<Ii64““046i 0,64 ”“0461 ■ 
CLxC%''^^ CL^Cy ^®iC§“~ClfCi fliC4“~il4Ci* 
€Lid% '■* ^dy Oydg ““ ^gdy dydg — “ ^gdy * 


when expanded by minors with respect to the elements of the first row. 
We have thus derived the important reduction formula: 


( 3 ) 


«1 

<h 

0. 

04* • • 

hx 


~h» 

bt’ ' ’ 

Cx 

1 Cs 


C4' • • 

dx 

; 

d. 

dt- ' ’ 


= l/Oi-** 


flyCy ®4®l * * 

illd^“^€L%dy dyd^ tt^dy dydg^"^ dgdy* 


Since each application of (3) lowers the order of a determinant by one, 
a continued application of it will reduce any determinant to the third 
order or even to the second order. 

Formula (3) shows that the first column of the new determinant of 
lower order is obtained by multiplying by Oi each element in the first 
column of the minor of Oi and then subtracting the product of the element 
at the top by the element at the extreme left. The elements in the second 
column of the new determinant are obtained in the same manner. This 
procedure should be memorized and applied as a working rule: Ux ftmss 
any eUment, minu$ the element at the top times the element at the 
extreme left. 

Example. Applying this rule to the numerical determinant evaluated 
above by minors, we have 



Abt. 921 


EVALUATION OF NUMERICAL DETERMINANTS 


263 


8—126 


7 16 13 

— 2 ; 34 1 

1 

— 8 4 13 

1 I —3 2 6 

~9 

14 6 —7 

2 ; 4 3 1 




= i (— 196 + 2912 — 620 — 728 — 466 — 896) = ^ = 13. 

e). Th€ triangular method. Another important method of evaluating 
a numerical determinant is to reduce it to triangular form and then take 
the product of the elements in the leading diagonal of the triangular 
determinant. A triangular determinant is one in which all the elements 
on one side of the leading diagonal are zero. Any determinant 


( 4 ) 


On 

Oia 

Oia- 

• ‘Oi. 

aai 

Oas 

a23' 

• Oa, 

On 

Oaa 

Oaa‘ 

• 0,, 

0.1 

0.a 

0.a‘ 

• o,. 


can be reduced to triangular form by the following procedure: 


Multiply the first row successively by — , — , • • • — and subtract the 

On Oji Oil 

results from the 2d, 3d, * -nth rows. We thus obtain 


( 6 ) 


where 


On 

Oia 

Oia- 

• o,. 

0 

hr. 

ftaa- 

• hr. 

0 

hr. 

h„- 

• •&.. 

0 

A ^ 

b» 

6«‘ 

• hr. 

0 

hnr 

6,a- 

• h„ 


^^21 1 ®2l j. 

22 — ^22 — ilt$ — etc. 

»ll «ll 


W ^31 « ^31 ■ 

On — ^32 — 

flu ail 


Now leaving the first row of (6) as it stands, we multiply the second 
row successively by • 'N and subtract the results frmn the 

Oj* Ott Oft 

8d,4th,* ■ ‘nth rows of (6) and thereby obtain 



264 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


( 6 ) 


where 



«!» 

an 

aia- • 

•am 


0 

hxt 

• 

•6,« 

2) = 

0 

0 

Caa“ * 

•Cxn 


0 

0 

Caa- • 

•Cm 


0 

0 

«iia- • 

•Cnm 

Caa — 6aa 

— 

baa V 

Ott 

etc. 

Caa = baa 

— 

baa r 

jT” 

att 

etc. 


By continuing this process we reduce (4) to the triangular form 


( 7 ) 


D = 


an 

ai2 

aia- 

.... 

■ aifi 

0 

b„ 

6,.- 

.... 


0 

0 

Csa ' 

.... 

•Cm 

0 

0 

0 

d44- • 

•d4. 

0 

0 

0 

0 • • 

• 1.. 


The value of the original determinant (4) is the product of the diagonal 
elements of (7), or 

(8) D — ' ' Inn- 


To prove this fact we expand (7) successively in minors with respect 
to the elements of the first column of each successive determinant. Thus : 



6 m 621 • • •62* 


C99 ^aa ■ ' C9n 


0 Cia • ’Ctn 


0 daa ’ ■ dm 



• 

0 0 

27 = 0x1 

0 0 

= ttiibaa 



0 0 


0 0 


0 0 u 






d*t d«- • •<!«» 

0 • • Sm 

0 0 


f 


1 0 0 u| 

and continue the expansions until we arrive at (8). 



Art. 92] 


EVALUATION OF NUMERICAL DETERMINANTS 


266 


If the elements of the determinant are inexact or rounded numbers, 
the denominators Un, htt, etc. of the fractional multipliers should be as 
large as possible in oMer to reduce errors as much as possible. Inter- 
changes of rows or columns will usually enable the computer to bring the 
largest elements to the positions of Un, ba, etc. 


Example. We now apply the trian^lar method to the determinant pre- 
viously evaluated by o^er methods. 


3 —1 .2 5 
—2 3 4 1 

1—326 
2 4 3 1 


On multiplying the first row successively by }, f, then adding the 
first result to the second row and subtracting the next two results from 
the third and fourth rows, respectively, we get 

12 5 

I 1® 

3 3 3 

8 4 W • 

3 3 3 

14 6_7 
3 3 3 

g 

Now multiplying the second row by y and 2 in succession, adding the 

first result to the third row an^ subtracting the second result from the 
fourth row, we get 



Z) = 



2 

5 

16 

13 

3 

■3 

52 

65 

7 

7 

— 9 — 

11 


63 

Finally, on multiplying the third row by and adding the result to 


the fourth row, we obtain 



266 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


Hence 


8 

0 



D = 


0 0 


0 0 


2 6 
3 8 
7 7 


D = 3x|xYXi =13. 


Conceming the merits of the three methods given above for evaluating 
a numerical determinant, thr method of expansion in terms of minors is 
the least desirable of the three. As to the other two methods, the pivotal 
method is preferable when the elements of a determinant are simple 
numbers of one or two digits such that they can be multiplied and sub- 
tracted mentally. When the elements are numbers of several digits, the 
triangular method is the best method of evaluation. 


93. Cramer’s Rule. A simple method of solving simultaneous linear 
equations by determinants was discovered by Gabriel Cramer* in 1760. 
To derive Cramer’s rule, as it is called, we consider a system of three 
equations 


( 1 ) 


' o,* o,y -f a,z = 4, 
- bi* -f 6*y + = 4» 

. Cx» + C2y + c.* = 4i. 


We first write down arbitrarily the determinant 


(*) 


(ai* + a*y + <*»*)* «* «• 

(bi* -f* bfy -j- b»*) bi bt 

(ei»H-Ciy + c,*) c, c. 


From (1) the elements in the first column of (2) are equal to fei, 4t, 4i, 
respectively. Hence ’we may write 


(«i»+o*y + <v) 

Of 

a» 


4i 


at 

(bx»-|-b 4 f-f b,s) 

b. 

b. 

= 

k. 

bz 

b. 

(ci*+<!»y+c,*) 

e> 

Ct 



Ct 

Ct 


• SwlM amthoutieiaa (1704-1768). 



Abt. 03] 


CRAMER'S RULE 


267 


The addition law of determinants states that the left member of (3) 
may be expressed as the sum of three determinants, as follows: 


aiX 

as 

as 



Os 

as 


agZ 

at 

as 

biX 

6* 

6s 

+ 

hty 

6* 

6. 

+ 

b^z 

bt 

6s 

CiX 

Cs 

Cn 


e^y 

Ci 

Cm 


c^z 

Ct 

Ct 


Factoring out x, y, and z from the first columns of these determinants and 
replacing the left member of (3) by the above sum, we have 


ai 

as as 


Us as as 


CLt at Us 


hx at at 

61 

Cl 

bt bt 

Ct Ct 


bt bt bt 
Ct Cj Ct 


6s 6s 6s 

Cs Cs Cs 

z = 

*•* bt bt 
ht Ct e. 


Since two columns of the second and third determinants on the left are 
identical, those determinants are each equal to zero and we are left with 


ai 

at 

at 


tx 

aj 

at 

b^ 

bt 

bt 

x = 

*•2 

bt 

bt 

Cl 

Ct 

Ct 



Ci 

Ct 


from which 

ki a» Us 

1 c 2 &2 ^3 

is C2 Cs 

x = 

Qx CLj ^3 
1 )\ &2 ^3 
Cl Cj Cs 

To find the value of y we write down the determinant 
tti (fliX + fljy + ^3*) fls 
ft I {bix+\y + hz) 6s , 

Cl (CiJ' + CsJ-f Cs«) Cs 


replace the elements in the second column by ii, is, is from (1), and then 
proceed as in finding x. The result is 


ii as 
61 is 6s 

Cl is Cs 

Ui Uj Os 

61 62 6s 

Cl Cj Cs 


18 



SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chat. XII 


To find s we atari with the determinant 

o, Ot (Oi* + 04 r + o,#) 
bi bt (iix + &tjr + ita) , 

Cl C, (CiX+Cijr + CtS) 

and proceed aa in the caae of x. Thia givea 

o, a, Ici 

bi bt h. 

Cl Cf hf I 

z = 

( 1 | (I 2 

bi bt bt 

Cl Ct Ct 

Note that the nuemratora of the fractiona giving x, y, and z are the 
aame aa the denominator except that the coeificienta of the deaired unknown 
are replaced by the known quantitiea (the k’a). Hence Cramer’a rule 
may be atated aa follows: 

Write down the determinant of the coefficients of the unknowns. Any 
unknown is equal to the fraction whose denominator is the determinant of 
the coefficients and whose numerator is the same determinant with the 
coefficients of the desired unknown replaced by the known {constant) terms. 

Cramer’a rule holda for ayatema of any number of equationa in the 
same number of unknowns and can be derived as was done above in the 
case of three equations. 

Example. Find y from the equations 

'3jr-f2y — z-\- 1 = 1 
X — y — 2z-|-4t = 3 
' 2j + 3y+ s — 2t = — 2 
^5jr — 2y -f- 3* 4* 21 = 0. 

Solution. The denominator of the fractiona for all the unknowns is 



3 _ 

__2 

— 1 

1 


— 5 

— 5 

11 


1 ’ 

— v 

— 2” 

”’4 

1 




D = 

2 • 

3 

1 

— 2 

“9 

5 

5‘- 

-8 


1 

5 ; 

— 2 

3 

2 


— 16 

14 

1 


= i (— 26 — 640 + 770 + 880 + 25 — 560) = 50. 

•F 



An. 94] METHOD OF DIVISION BY LEADING COEFFICIENTS 
Hence 


see 


8 1 1 


8 —5 11 

1 ; 3—2 4 

1 

— 8 6 —8 

8 • _2 1—8 

9 

— 6 14 1 

6 ; 0 3 2 




60* 


60 


50 


19 51 

The other unknowns are found to be x=— , s = — — , t = 0. It will 

50 50 


be found on substitution that these values satisfy equations (A). 

Although Cramer’s rule is simple and easy to apply, its use requires a 
great deal of labor when the number of equations exceeds four or five, 
because of the labor in evaluating the determinants involved. 


n. SOLUTION BY SUCCBS8IVB ELIMINATION OF THE UNKNOWNS 

Several methods have been devised for solving systems of linear equa- 
tions by successive or step-by-step elimination of the unknowns. Three 
of the most important of these methods will be explained in the following 
pages. 

94. The Method of Division by the Leading Co^cienta. The simplest 
of the step-by-step elimination methods is that in which each equation is 
first divided throughout by its leading coefficient, the eqyations being 
thereby transformed into a second set in which all leading coefficients 
are unity. To complete the first step, one of the transformed equations, 
which will be called the pivotal equation, is subtracted from each of the 
others (if all leading coefficients are positive), thus eliminating one un- 
known from the set. If the original set contained n unknowns, the first 
step reduced it to a set in n — 1 unknowns. 

If the same procedure is applied to the new system in n — 1 unknowns, 
we get a third set of equations in n — 2 unknowns; and by continuing the 
process we finally arrive at a single equation in one unknown. When the 
unknown has beep found from this last equation, it is substituted into the 
preceding pivotal equation. The method of back substitution is continued 
until all the unknowns have been found, the back substitutions always 
being made into the immediately preceding pivotal equations. The fol- 
lowing example should make the method clear. 



270 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


Example. Solve the equations 

476* — 316y — 407* + 263< = 581 
2962 — 482y — 395* + 242t z= 720 
3642 — 421y — 643* + 3421 = 634 
2822 — 286y — 315* + 4481 = 266. 

Solution. On dividing each equation throughout by its leading coefficient, 
we get 

(oi ) 2 — 0.66527y — 0.85684* -|- 0.532641 = 1 .09685 

(a*) 2 — 1.6284y — 1.3345* + 0.817581 = 2.4324 

(o,) 2— 1.1566y — 1.7665* + 0.939581 = 1.7418 

( 04 ) 2 — 1.0142y — 1.1170* + 1 .5887< = 0.94326. 

Now taking the first of these equations as pivotal equation and sub- 
tracting it from each of the others, we get the new set 

— 0.9631y— 0.4777* -f 0.28494< = 1.3356 

— 0.4913y — 0.9097* -f 0.40694< = 0.6450 

— 0.3489y — 0.2602*4- 1.0561/ = — 0.15359. 

This completes the first step. Succeeding steps are carried out in exactly 
the same manner. 

In practice, the solution is exhibited in compact form as shown below, 
the unknowns being written at the top of the table and only the coefficients 
shown in the work. Since errors are liable to be made in the computation, 
checks are provided in each line of the table. The column headed " Sum ** 
gives the algebraic sum of the coefficients and constant term in each 
equation. The check numbers were obtained by performing on the pre- 
vious sums the same operations (divisions and subtractions) as were per- 
formed on the corresponding previous equations. For example, the check 
number 2.5869 in row (5,) was obtained by dividing — 2.4915 by — 0.9631 ; 
and the check number 0.7492 in row (bz) — (b,) cam^ from subtracting 
2.5869 from 3.3361. The truth of the checks is evident from the axioms 
that if equals are divided by equals the quotients are equal, and if equals 
are subtracted from equals the remainders are equal. Hence the sums 
should agree with the check numbers in the same row. 



Abt. 94] METHOD OF DIVISION BY LEADING COEFFICIENTS 


271 



w 

y 

s 

i 

c 

Sum 

Cheek 


475 

—310 

—407 

253 

—521 

—516 



290 

—482 

— 395 

242 

—720 

—1059 



304 

—421 

—043 

342 

—634 

—992 



282 

—280 

—316 

448 

—266 

—137 


(a,) 

1 

—0.00527 

—0.85684 

0.53264 

—1.09685 

—1.08032 

—1.0863 

(Of) 

1 

—1.0284 

—1.3345 

0.81758 

—2.4324 

—3.5777 

—3.5778 

(«.) 

1 

—1.1560 

—1.7665 

0.93958 

—1.7418 

—2.7253 

—2.7252 

(a.) 

1 

—1.0142 

—1.1170 

1.5887 

—0.94320 

—0.4858 

—0.4858 

(«.)-(ai) 


—0.9631 

—0.4777 

0.28494 

-1.3356 

—2.4915 

—2.4914 

(«•)-(«!) 


—0.4913 

— 0.0097 

0.40694 

—0.6450 

—1.6391 

—1.6390 

(a.)-(a,) 


—0.3489 

—0.2602 

1.0561 

0.15359 

0.6006 

0.6005 

(»>) 


1 

0.49601 

—0.29586 

1.3868 

2.5869 

2J1869 

<b.) 


1 

1.8516 

—0.82828 

1.31284 

3.3361 

3.3362 

(b.) 


1 

0.74578 

—3.0269 

—0.44021 

—1.7213 

—1.7214 

(6.)-(6.) 



1.3556 

—0.53242 

—0.0740 

0.7492 

0.7492 




0.24977 

—2.7310 

—1.8270 

—4.3082 

—4.3082 

(e>) 



1 

—0.39276 

—0.05450 

0.55205 

0.55267 

(e.) 



1 

—10.934 

—7.3148 

—17.249 

—17.249 





—10.541 

—7.2002 

—17.801 

—17.802 


Uaving reduced the given system to the single equation 


we find 


— 10.5411 — 7.2602 = 0, 
f = — 0.68877. 


Now substituting this value of t into (C|), we have 


whence 


z — 0.39276 (— 0.68877) — 0.05459 = 0, 


z = — 0.21593. 


On substituting into (bi) these values of i and x, we find 

y = — 1.4835. 

Then these known values are substituted into (oi) to find 

x = 0.291 77. 


As a final check we substitute the above values of x, y, z, and t into the 
original equations and find that the left members become 620.99, 720.01, 
634.02, and 266.02, respectively. The discrepancies are thus only — 0.01, 
0.01, 0.02, and 0.02, respectively. 



278 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chaf. XII 


95. The Method of Oaun. In the Gauss method of solving simul* 
taneous linear equations, as explained in detail and with great clarity by 
Encke,* the unknowns are eliminated successively by solving some equation 
for one unknown in terms of all the others; then substituting this value 
for the same unknown in all the remaining equations, thereby eliminating 
the unknown from the set. The process is repeated on the new set of 
equations, thus eliminating another unknown; and so on until the system 
is reduced to a single equation in one unknown. 

The equations which express one unknown explicitly in terms of all the 
others are called pivotal eguatwns. After one unknown has been found, 
the remaining unknowns are found by back substitution into the pivotal 
equations. 

In the solution of numerical equations by Gauss’s method, one should 
always select as pivotal equation the equation which has the largest coeffi- 
cient of the unknown it is desired to eliminate. For example, if the 
unknowns are to be eleminated in the order x, y, x, (or x,, Xt, x^,), the first 
pivotal equation is obtained by solving for x that equation which has the 
largest x-coefficient. Then the second pivotal equation is obtained by 
solving for y that equation (in the new set) which has the largest y-coeffi- 
oient, etc. A slightly better result may be obtained by disregarding the 
order of elimination and solving, at each step in the elimination process, 
the equation in which the largest coefficient in the entire set occurs, this 
equation being solved for the unknown to which the largest coefiicient is 
attached. The reason for preferring the largest coefficients will appear 
later. The following example illustrates the Gauss method. 

Example. Solve the following set of equations by Gauss’s method, 
assuming that the coefficients and known terms are exact numbers: 

(1) 2.63x + fi-21y— 1 -6942-1- 9.938t — 4.23 =0 

(2) 3.16x—2.95y -I- 0.8132— 4.21f +0.716 = 0 

(3) 5.36x+1.88y— 2.152 — 4.96( —1.28 =0 

(4) 1.34x + 2.98y — 0.4322 — 1.768< — 0.419 = 0. 


Solution. The variables will be eliminated in the order x, y, t, t. Since 
the largest coefficient of x occurs in (3), we take that as the pivotal equa- 
tion. Solving it for x, we get 


( 6 ) 


_ 1.88 2.15 ,4.95 1.28 

“■ 5.36 ^ 5.86 * 5.36 * "*■ 5.36 

= — 0.360746y -f 0.4011192 + 0.923506< + 0.238806. 


* BerUner Aetronomieeihee Johrhuehi 1835, pp. 867-272; 1836, pp. 256-258. 



273 


Abt. 95] THE METHOD OF GAUSS 

Substituting this value of x into (1), {%), and (4), and reducing, we get 

(6) 4.287638^— 0.63906s +3.36682t —3.601940 = 0 

(7) — 4.06836y + 2.08054s — 1.29172< +1.470627 = 0 

(8) 2.610000y + 0.106600s — 0.630500t — 0.099000 = 0. 

Since (6) has the largest y-coefficient, we solve it for y and get 

(9) y = 0.149051s — 0.785268t + 0.840096. 

Substituting this value of y into (7) and (8), we get 

(10) 1.47664s +1.89514< — 1.93879. = 0 

(11 ) 0.479618s — 2.601500t + 2.00964 = 0. 

Here we take (10) as the pivotal equation and get 

(12) s = — 1.28428f + 1.31386. 

Substituting this value of s into (11), we get the final equation 

(13) — 3.11 7463< + 2.63979 = 0, 
from which t = 0.846775. 

The values *of s, y, and x are found by back substitution into (12), (9), 
and (6), respectively. Substituting the value of t into (12), we find 

s = 0.22636. 

Then substituting the values of t and z into (9), we find 

y = 0.208898. 

Finally, on substituting the values of t, z. and y into (5), we get 

2 = 1.038336. 

The solution of the given system of equations is thus 

2=1.038335 
y = 0.208898 
s = 0.22636 
< = 0.846776. 

When these values are substituted into the original equations (1), (2), 
(3), (4), the left members of those equations have the values 0.00000. 
0.00000, 0.00001, 0.00000, respectively. 



274 SOLUTION OF SIMULTANEOUS UNEAR EQUATIONS [Chap. XII 

It will be observed that the Oauss method reduced the original qraton 
of equations to the triangular qrstem of pivotal equations: 

5.36X +1.88y —2.16s — 4.96< —1.28 = 0 

4.287538y — 0.63906s + 3.36682t — 3.601940 = 0 

1.47664s -f 1.89614t — 1.93897 = 0 

— 3.11 7463< + 2.63979 =0. 

Since the value of the determinant (rf the coefficients in the given system 
is equal to the product of the leading coefficients in the triangular system 
(Art. 92), we have 

A= (6.36) (4.287538) (1.47664) (—3.117463) = — 105.720. 


86. Another Version of the Gauss Method. If we apply the method 
of the previous article to the equations 

OjX + Ojy -f o,s = If, (a) 

(1) bix + bijf + b,z = k, (b) 

eiX + c,!f + c,z = k, (c) 


and assume that a, is larger than either 6, or e„ we take (a) as the pivotal 
equation and solve it for x, obtaining 


x = 


b. 

Oi 



Ot 


s. 


On substituting this into (b) and (c), we get 


(6,— ^o,)y+(b,— ^o,)s = fc, — (d) 
a, o, a, 

(c»— |^o*)y+ (e.— ^Oi)» = *. — (e) 

dl di di 


Equations (2) are exactly what would have been obtained if we had 


njnltiplied (1) 


(a) successively by 


b e 

— and — and then subtracted the 
o, a, 


resulting equations from (1) (b) and (1) (c), respectively. Equations (2) 
^11 also be obtained if we first divide (1) (a) throughout by a,, then 
multiply the resulting equation successively by b, and e,, and then subtract 
these resulting equations from (1) (b) and (1) (c), respectively. The 
Gauss method is therefore equivalent to either of the following procedures: 



Akt. 96] ANOTHER VERSION OF THE GAUSS METHOD 276 

1. Choose the pivotal equation just as in Art. 96. Multiply this pivotal 
equation successively by such positive numbers as will make its leading 
coefficient in each case numerically equal to the leading coefficients of the 
other equations of the set. Then subtract the multiplied pivotal equations 
from the other equations having the same leading coefficients, thereby 
eliminating one unknown completely. Follow the same procedure with 
the new equations in n — 1 unknowns. Or: 

2. Divide the pivotal equation throughout by its leading coefficient. 
Then multiply the resulting equation successively by the leading coefficients 
of the other equations and subtract the multiplied pivotal equations from 
the other equations having the same leading coefficients. Follow the same 
procedure with the new set in n — 1 unknowns. 

In case the leading coefficients in some of the other equations are negative 
(or have signs opposite that of the leading coefficient of the pivotal equa- 
tion), the multiplied pivotal equations are added to the other equations 
instead of subtracted from them. 

Example. Solve the equations 

2.63* -f 5.21y — 1.694* -f 0.938( — 4.23 = 0 (a) 

3.16* — 2.96y -4- 0.813* — 4.21i +0.716 = 0 (b) 

5.36* + 1.88y — 2.15* — 4.96( —1.28 =0 (c) 

1.34* + 2.98y — 0.432* — 1.768( — 0.419 = 0 (d) 

by the first method stated above. 

Solution. We take (c) as the pivotal equation and multiply it suces* 

. , . 2.63 3.16 , 1.34 .. . . ^ 

sively by thereby obtaining the equations 

’ 2.63* + 0.92246y — 1.05494* — 2.42882( — 0.62806 = 0 

(B) - 3.16*+1.10836y— 1.26764* — 2.91828( — 0.764627 = 0 

1 .34* + 0.47000y — 0.63750* — 1 .23750( — 0.32000 = 0. 

Now subtracting the first of these equations from (a), the second from (b), 
and the third from (d), we get 

'• 4.28754y — 0.63906* + 3.36682( — 3.60194 = 0 

(C) . — 4.05836y + 2.08054* — 1.29172e + 1.47063 = 0 

2.61000y + 0.10550* — 0.63050( — 0.09900 = 0. 

These equations are the same as (6), (7), (8) of Art. 95. 




sre SOLUTION of simultaneous linear equations [Chap. XII 

The computation is usually presented in tabular form as giyen below : 


m 

y 

0 

i 

h 

Sum 

Cheek 

2.SS 

6.21 

—1.694 

0.938 

—4.23 

2.864 


3.16 

—2.96 

0.813 

—4.21 

0.716 

—2.471 


5.S8 

1.88 

—2.16 

—4.96 

—1.28 

—1.14 


1.34 

2.98 

—0.432 

—1.768 

—0419 

—1.701 



4.28764 

—0.63906 

3.36682 

—3.60194 

3.41336 

3.41337 


—4.06836 

2.08064 

—1.29172 

1.47063 

—1.79801 

—1.70801 


8jnooo 

0.10660 

— OJS3060 

—0.09900 

1.98600 

1.98600 



1.47664 

1.89614 

-1.03879 

1.43100 

1.43200 



0.47962 

— 2A0160 

2.00004 

—0.01224 

—0.01224 




—3.11747 

2.63079 

—0.47768 

—0.47767 


From the last equation we find 

g.68979 _ 

*“8.11747“ 


0.846778. 


Back substitution into the other pivotal equations gives 


s = 0.82687 
y = 0.208901 
x = 1.08884. 


When these values are substituted into the given equations, the left members 
of those equations become 0.00001, O.OOOOOi 0.00000, and 0.00001, respec- 
tively. The slight discrepancies between the results found above and those 
found in Art. 96 are due to the fact that some of the numbers in the 
above table were rounded off to the same number of decimal places as 
some other numbers to which they had to be added to obtain the '^sums.” 

The numbers in the check column were obtained from the ** sums ” found 
in the preceding sets of equations, by applying to those sums the same 
operations as were applied to the pivotal equations. For example, the 
check numbers 1.43200 and —0.01224 were obtained as follows: 


— 1.79891 -I- X 3.41386 = 1.43200 

4.«o704 

1.98600— X 8.41836 = -0.01224. 

In case a check number fails to agree with the sum immediately to the 
left of it» a mistake has been made in the computation and ■bmil d be 
found and corrected at mice. 






Abt. 07] 


SOLUTION BY INVERSION OF MATRICES 


277 


If we wished to solve the given set of equations by the second method 
outlined above, we would first divide equation (c) throughout by 6.36, 
thereby obtaining the pivotal equation, 

X 4- 0.360746y — 0.401119s — 0.933607f — 0.838806 = 0. 

Then we would multiply this equation successively by 8.63, 3.16, and 1.34. 
The resulting equations would then be subtracted from (a), (b), and (d), 
respectively, thus obtaining equations which should agree with equatimis 
(C). The solution would be continued by dividing the first of the new 
equations throughout by 4.28754 to get a new pivotal equation, etc. After 
finding t, we would find the other unknowns by back substitution into the 
pivotal equations. As a final step, we would substitute the computed values 
of X, y, s, and i into equations (A) as a check. 

We may now state the reason for choosing as pivotal equations those 
equations having the largest coefficients of the unknowns we desire to 
eliminate. A glance at equations (2) shows that fractional terms are 
present in the coefficients and in the constant terms. It is desirable that 
such fractional terms be as small as possible, and this necessitates that the 
denominator Oi be as large as possible. Moreover, if the numerators of 
such fractions contain rounding errors, the effects of such errors are 
diminished when the denominator a, is large. 


IIL SOLUTION BY INVERSION OF MATRICES 

97. Definitions. A matrix is a rectangular array of quantities or 
numbers, such as 

Uifl*o«a4 

CxC%C%C^m 

To distinguish such an array from a determinant, which it resembles in 
appearance, it is always enclosed by square brackets, large parentheses, or 
double bars, as: 


"aiOtOtat " 

/OiO^tat^ 



• hibfh^h^ 


or 


_ CiCfCgCg ^ 

\c,eie,c«/ 


OiOtCfCg 


We shall use the square-bracket notation and write a general matrix in 
the form 



*78 SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 

Oiiaitaia* • *01, “ 


L J 

The quantities an, an, etc., are called the elements of the matrix, as in 
the case of determinants. It is to be noted that the first digit in the 
doable subscript of an element denotes the row and the second digit 
denotes the column in which the element stands. A matrix of m rows 
and n columns is an m X n matrix. 

If m = n, the matrix is a square matrix of order n. 

Ol 

Oa 

. matrix may consist of only a single column, as 

in which case it is called a column matrix. It is merely a special case of 
a general matrix. 

If all the elements in the leading diagonal of a square matrix are unity 
and all the other elements are zeros, the matrix is called a unit matrix. 
Thus, 

10 0 0 
0 10 0 
0 0 10 
0 0 0 1 

is a unit matrix of the fourth order. A unit matrix of any order will be 
denoted by the symbol I. Unit matrices play an important role in the 
application of matrices. 

If all the elements of a matrix are zero, the matrix itself is zero. 
Although the elements of a matrix are numbers, the matrix itself is not 
a number. It plays the role of an operator, as we shall see later. 

98. Addition and Subtraction of Matrices. Two matrices of the 
same order can be added or substracted by adding or subtracting their 
corresponding elements. Thus, the sum of the two matrices 





^ii^iaftia 

A = 


and B = 

baibaa&as 


^®ai®ia®sa 


_6ai6ia&M_ 



Art. 99] 
is the matrix 


MULTIPLICATION OF MATRICES 


279 


®ii + ftii <*1* 4* “f* 

(7 = 0*1 + 6*1 o** + bft 0*1 + h** . 

0*1 -|- 6*1 fl»* -f“ 6»a 0*1 -f- ht » _ 

We therefore write 

A+S = C. 

The difference of two matrices is found in the same manner, and we 
therefore write 

A—B=zC', 

where the elements of C' are those of C with the signs of the fr’s changed. 
Examples: 

("2 3 — 4"| r 1 — 2 4"|_r3 1 01 

1_5 — 1 oJ‘*"L3 5 2 J“L8 4 2 J 

“3—2 61 ["2 3— ll r 1—5 6“ 

6 0 3 — 4 1 0= 2 —1 3. 

_1 5 4 J _6 2— ij 4 3 5 


99. Multiplication of Matrices, a) Multiplication of a matrix by a 
simple number or scalar. To multiply a matrix by a number or scalar 
quantity, we multiply every element of the matrix by that number. For 
example, 

t O|A 20 * ^ I rna\ ma% ma^ 

bibibfj \_mbi mbt mb» 

To see the reason for this rule of multiplication, let us consider the sum 
of three matrices of the second order, 



_ , „ Sii®!* r^iibi* , n ciiCi* 

S-t-C'= + x 1 ; + 

_02lfl22J L®2ivtt J <^21^22 _ 


Let us suppose now that the three matrices become identical, so that 
B = C ■= A and bn = Cr, — an- Then we have 



and so in general. 

Note that this multiplication differs from the multiplication of a deter* 



S80 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


minant by a scalar, for in the latter case the elements of only one row or 
column are multiplied by the scalar. 

b) Multiplication of a matrix by another matrix. In the system of 
equations 

OiZi 4" “I" » — ^1 

(1) •< bjZa “1" b«Zs — If* 

C,*1 + axt + c,x, = k, 

the array 

OiOtflt 

btbiba 

CiCfCt 

is called the matrix of the coefficients. This matrix may be regarded as 
an operator which operates on the x’s to produce the k*B on the right side 
of the equations. The operation is seen to be a type of multiplication. 
If the x’s be arranged in a vertical column, as 

Xt 

Xi 

x», 

the left member of the first equation of (1) is seen to be the sum of the 
products of the elements of the top row of the coefficient matrix by the 
corresponding x’s in the vertical column. The left members of the other 
equations can be obtained in the same manner. We are therefore justified 
in writing (1) in the form 



With (2) as a starting point we utilize a linear transformation to derive 
the rule for the multiplication of matrices in general. 

Consider the two systems of equations 


(3) 

and 


f jfl UllXi -|- “I" ^IJ^3 


or 



“I" ^11^12 p — I ^1 

* Z 2 ^2iyi ^'I2y‘2 I I ^2 

Zi = fraiyi “I" ^ 22^29 __ ^ _ 


( 4 ) 



Aw. 99] 


MULTIPLICATION OF MATRICES 


881 


E l i mi n atin g jfi and yt by anbatituting in (4) their valnea aa given in (3), 
we get 


*1 — (buOii -f- 4* (hudtt “i* *t* (biiOis “1“ bitfln)Xt 

*t — (bfiOii “1“ (b*i®n “1“ baafh»)^$ 

*• = (baiati + &ss®ai)®i + (bii®i9 + + (b»i®i« + btt®3>)®a> 


or, in matrix form. 


“1" ^11^12 “h ^12^22 ^11^18 “h ^12^8 

~^t~ 



^21^11 “1“ ^22^21 4" ^22^22 ^21^18 4" ^22^28 



*» 

_ ^81^11 4” ^82^21 ^81^12 4“ ^82^*22 ®81®18 4“ ®82®28 _ 





( 8 ) 


Now replacing the column matrix J of 
(3), we get 

if) 


(4) by ita value as given in 


" 611612 " 



"*1“ 




^11^12^18 1 




621611 


flllfl«fli8 1 

** 

— 

Ss 

_ 6ii6ji _ 


L. J 





Comparison of the left members of (6) and (7) gives the relation 


( 8 ) 


[ OnOijOii ~| 

” biiOii + biifln 
_bsiflii -f* 


bitdtt hitOn 

b»iOi* "I" bijOia 
b»»Oi* “f* ^aa®»* 


bii®i3 "t" bi*0** 
bjlOlt + bx2®xt 
bsiOia “1“ bftfltt 


Formula (8) expresses the rule for the multiplication of matrices. If 
the first of the two matrices in the left member of (8) be denoted by B 
and the second by A, a glance at (8) shows that the elements in the 
product matrix in the right member can be obtained by following the 
procedure outlined below. It is best to compute the product by columns ; 
that is, compute one column at a time, beginning with the first. 

First column of product: 

To find the first element in the first column of the product, multiply the 
elements in the first row of B into the corresponding elements in the first 
column of A and sum the products thus obtained. To find the second 
element in the first column of the product, multiply the elements in the 



SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


second row of B into the corresponding elements in the first column of A 
and sum the products thus obtained. The ihird element in the first column 
of the product is found by multiplying the elements in the third row of B 
into the corresponding elements in the first column of A, and so on for 
the remaining elements in the first column of the product. 

Second column of product: 

To find the first element in the second column of the product matrix, 
multiply the elements in the first row of B into the corresponding elements 
in the second column of A and sum the products thus obtained. To find 
the second element in the second column of the product, multiply the 
elements in the second row of B into the corresponding elements in the 
second column of A and sum the products thus obtained. The remaining 
elements in the second coliunn of the product are found in a similar 
manner. 


Third column of product: 

Proceed as for first and second columns, except that the elements in 
the rows of B must be multiplied into the corresponding elements in the 
third column of A. 


Example. 


- 3 

1 

2- 


-—2 

4 

1- 

— 1 

2 

3 

X 

3 

— 1 

2 

2 

— 5 



4 

1 

3_ 


-3(— 2) + l(3) + 2(4) 
(-l)(-2) + 2(3) + 3<4) 
2(— 2) + (—5)3 + 7(4) 


3(4) + 1(— 1) + 2(1) 3(1) + 1(2) + 2(3) - 

(— 1)4 + 2(— 1) + 3(1) (- 1)1 + 2(2) + 3(3) 

2(4) + (— 5)(— 1) + 7(1) 2(1) + (— ) 2 + 7(3)_ 


"5 13 11- 

20—3 12 

9 20 13 


Note that in matrix multiplication: 

(a) Rows are always multiplied into columns. 

(b) The number of rows in the product is the same as the number of 
rows in B, and the number of columns in the product is the same 
as the number of columns in A. 

(c) The number of rows in one factor must be the same as the number 
of columns in the other factor. 



Abt. 99] MULTIPLICATION OF MATRICES 288 


If A and B denote any two matrices and C denotes the product AB, 
then 

AB = C. 

But BA ^ C, in general. 

That is, matrix multiplication is not commutative in general. For 
example, 

but 

an entirely different result. 

An important exception occurs in the case of the product of any matrix 
by a unit matrix. For example, 



ri 0 01 


010 

_ CiCtCi ^ 

— 1 
1 -® 

0 

0 

1 



"Oj -1-0 -1-0 

0 + O. + 0 

0-|-0^-o,- 



=S 

ft, + 04-0 

0 + 6, -1- 0 

0 + 0 -1- ft* 




C| -f- 0 -f- 0 

0 + C.-I-0 

o + o-fc. 


CiCzC^ _ 


and 


"10 0 
0 10 
0 0 1 


bibtba 

CiCtCi 



-a,-l-0-l-0 

a* + 0 + 0 

o, + 0 + 0“ 


0 , 0*03 “ 


0 + 6, + 0 

0 + 6, + 0 

0 + 6, + 0 

= 



0 0 Cl 

0 + 0 + c, 

0 + 0 + c._ 


_ CiCfCz __ 


which is the same result in both cases. 

It is to be noted that the multiplication of a matrix bp a unit matrix 
does not change its value. More generally, 

AI = AP = - • —APzxA. 


In the product AB = C, B is said to be premultiplied by A ; whereas in 
the product BA = B, B is said to be poetmultiplied by A. Because matrix 
multiplication is not commutative in general, premultiplication and post- 
multiplication do not give the same result in general. 


10 



284 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Ciur. XII 


The multipUctticm of one mntrix by enother is a most important p r oce s s 
and shonld be thoronghly learned and kept in mind by ereiybody who works 
with matrices. It is nsed constantly in checking the inrersicm of matrices. 

100. Inwanios of Xatxicea. The operation of diriding one matrix 
directly by another does not exist in matrix theory, but the equiralent of 
dixisiim can be accomplished in most cases by a process called the inrersion 
of matrices. The inrerse of a square matrix A is another square matrix 
A~* of the same order, such that 

(1) AA-^ = I, 

where 7 denotes a unit matrix of the same order. Moreover, it can be 
shown that 

A-U=I, 

so that 

AA-* = A-^A. 

Hence a matrix is commutative with its inverse. The process of finding 
A~* when A is given is called the inversion of A. Before showing how to 
find A'\ we digress for a moment to consider the condition necessary for 
its existence. 

If A denotes any square matrix such as 


the determinant 


OlOjfl* 

A — hi&jbi , 

_ CiCgCa _ 
Ojttjdt 

I d. I = 

CiCtCg 


is called the determinant of the matrix. If the determinant of a matrix 
is not zero, the matrix is said to be non-singular and can always be inverted. 
That is, if | A | 0, can always be found. The reason for this is 

as follows: 

It is shown in most books dealing with matrix theory that 

Ai Bi Cl 

m Ri PT 

(*> ^ = Ri w m -m 

ii> Bt C% 

Lpt Rl RTj 


Ax Bi Cx 
At Bf Cf I 
At Bi (7i_ 



Aw. 100] 


INVERSION OF MATRICES 


286 


where At, Bu etc., tie the cofacton of the elements Si, ht, etc., in the 
determinant | A |. Evidently A~' could not eiist if | il | = 0. 

Several methods have been devised for finding the inverse ol a matrix. 
It can be found, for example, means of formula (2). Although (2) 
is of value theoretically, it is of little value in the inversion of numerical 
matrices, because of the large number of determinants (the cofactors) 
that must be evaluated. The method of inversion explained below is 
simple, direct, and reasonably short. By a procedure similar to the Gauss 
method explained in Art. 96, the given matrix is transformed into its 
inverse by means of a unit matrix of the same order. The transformation 
is made in consecutive steps, the number of steps being equal to the order 
of the matrix. A single column of the unit matrix is used in each step. 
The aim in each step is to reduce to zero all the elements in the first 
column except one, and that element is reduced to unity by dividing its 
row throughout by such a number as will make it unity. The matrix at 
the beginning of each step is augmented by the appropriate column of the 
unit matrix, and all elements in each row of the augmented matrix are 
subjected to the same operations. The underlying theory of this method 
of inverting matrices will not be given. The fact that the method always 
gives the correct result is a sufficient indication of its soundness. The 
simplest case of the method and a more general case will be explained by 
means of examples. The first element in the pivot line of every matrix 
will be in bold type. 

a) The *imple»t ease. 

Bxampls 1. Find the inverse of the matrix 


2 


A = 




4" 

2 

J 


Solution. The given matrix is first augmented by inserting the first 
column of a unit matrix of the third order, as follows: 






4 

2 


— 1 


1 

0 

0 



t80 


SOLUTION OF SIMULTANEOUS UNEAR EQUATIONS [Chap. XII 


The first row is then divided throughout by 2, giving 


1 —1 
2 3 

1 1 


2 

2 

1 


1/2 

0 

0 


We now multiply the first row by 2 and subtract the result frmn the second 
row, and we also add the first row to the third row, thereby obtaining 


1 —1 2 
0 5—2 

0 0 1 


1/2 
— 1 
1/2 


This ends the first step of the transformation. 

To start on the second step, we discard the first column of the 
matrix and augment the last three columns by the second column of 


unit matrix. Thus, 


— 1 2 1/2 

6 —2 —1 
0 1 1/2 


0 

1 

0 


old 

the 


We now divide the second row throughout by 5 and add the result to the 
first row, thus obtaining 

"O 8/5 3/10 ; 1/5“ 

l_2/5_l/5 '1/6 . 

0 1 1/2 I 0 _ 


This ends the second step. Note that nothing was done to the third row, 
because its element in the first column was already 0. 

We begin the third and last step of the transformation by discarding the 
first column of the matrix just found and then augmenting the remaining 
columns by the third column of the unit matrix. Then we have 


8/5 

3/10 1/5 

o“ 

— 2/5 

— 1/5 1/5 

0 

1 

1/2 0 

1 


Now multiply the third row by 8/5 and subtract the result from the first 
row, and also multiply the third row by 2/5 and add the result to the 
second row. This gives 


"0 

-1/2 

1/5 

— 8/6“ 

0 

0 

1/6 

2/5 

1 

1/2 

0 

1 



Am. 100] 


INVERSION OF MATRICES 


S87 


This ends the third step of the transformation. On dropping the first 
column of the matrix just obtained, we have 

" — 1/2 1/5 —8/6" 

0 1/5 2/5 . 

1/20 1 _ 

as the inverse of the given matrix. 

Since mistakes are easily made in the transformations, we check the 
result by seeing whether the inverse premultiplied by the given matrix 
gives the unit matrix. We therefore have 


" 2 

— 2 

4“ 


--1/2 

1/6 - 

-8/6" 

2 

3 

2 

X 

0 

1/5 

2/6 

_— 1 

1 

-Ij 


1/2 

0 

1 


■-14-0+ 2 2/5 — 2/5 + 0 —16/5 — 4/6 + 4 

= —1 + 0+ 1 2/5 + 3/5 + 0 —16/6 + 6/6 + 2 

_ 1/2+0 — 1/2 — 1/5 + 1/5 + 0 8/5 + 2/5 — 1 

The inverse found is therefore correct. 

Example 2. Find the inverse of the matrix 

"2-2 0-1“ 

0 2 12 
1—2 3—2 
_0 12 2 _ 

Soluixon. The steps in the solution are shown below. 


-% 

— 2 

0 

— 1 

1“ 


"1 

— 1 

0 

-1/2 

1/2“ 

0 

2 

1 

2 

0 


0 

2 

1 

2 

0 

1 

— 2 

3 

— 2 

0 


1 

— 2 

3 

— 2 

0 

0 

1 

2 

2 

0 


0 

1 

2 

2 

0 


Subtract row 1 from row 3. Then we have 

"1 —1 0 — 1/2 
0 2 1 2 

0—13 —8/2 
0 12 2 


■— 1 

0 

- 1/2 

1/2 

0 “ 


■— 1 

0 

- 1/2 

1/2 

0 “ 

8 

1 

2 

0 

1 


1 

1/2 

1 

0 

1/2 

— 1 

8 

— 8/2 

- 1/2 

: 0 


—1 

3 

— 8/2 

- 1/2 

0 

1 

2 

2 

0 

! ®-. 


1 

2 

2 

0 1 

0 


1/2 

0 

■ 1/2 

0 


End of step 1 


"1 0 0 ~ 
= 010 . 
0 0 1 



SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Glut. XH 


Add raw S to raws 1 and 8 and subtract it from row 4 , obtaining 


0 1/8 
1 1/8 
0 7/8 
0 3/8 

1/8 
1/8 

7/8 
8/8 


1/8 

1 


1/8 

1 

■ 1/8 

1 


1/8 

0 

■ 1/8 

0 


1 /* ; 

1/2 ■] 

® ; 

1/2 

1/2 ; 

1/2 

0 

1/2 

1/2 1 

1 01 


1/8 

1/8 

■ 1/8 


0 

1 

0 


End of step 8 


1/2 

1/2 

3/2 


1/2 

1 

•1/14 

1 


1/2 1/2 
0 1/2 
•1/14 1/14 
0 — 1/2 


0 

0 

1/7 

0 


Subtract row 3 from rows 1 and 2, and subtract three times row 8 from 
row 4. The result is 


" 0 

4/7 

< 

CO 

1 

-1/7“ 


0 

16/14 

1/14 ^/7 : - 

-1/7 


% 

— 1/14 

— 1/14 1/14 ; 

1/7 


_ 0 

17/14 

3/14 —6/7 I - 

-3/T_ 




■0 4/7 4/7 

3/7 

; -1/7“ 



0 15/14 1/14 

3/7 

: -1/7 



1 —1/7 —1/7 

1/7 

: 2/7 



0 17/14 3/14- 

-5/7 

; —3/7 


End of step 3 


4/7 

4/7 

3/7 

-1/7 

0“ 

16/14 

1/14 

3/7 

-1/7 

0 

-1/7 - 

-1/7 

1/7 

2/7 

0 

17/14 

8/14 

— 5/7 

— 3/7 

1 


4/7 

4/7 

3/7 

-1/7 

0 " 

15/14 

1/14 

3/7 

-1/7 

0 

-1/7 - 

-1/7 

1/7 

8/7 

0 

1 

3/17 

— 10/17 

— 6/17 

14/17 


Multiply raw 4 by 4/7 and subtract result from row 1, multiply row 4 
by 16/14 and subtract the result from row 2, and multiply row 4 by 1/7 
and add the result to row 3. The result is 

”0 8/17 13/17 1/17 I —8/171 

0 —8/17 18/17 4/17 ; —16/17 

0 —2/17 1/17 4/17 ; 2/17 

1 8/17 —10/17 —6/17 ; 14/17 


End of step 4. 



Aw. 100] INVEBSION OF MATRICES ttO 


Hence 


" 8/17 

18/17 

1/17 

-8/17" 


" 8 

18 

1 

-8“ 

— 2/17 

18/17 

4/17 

— 16/17 

= 1/17 

— 2 

18 

4 

— 15 

—2/17 

1/17 

4/17 

2/17 

—2 

1 

4 

2 

8/17 

— 10/17 

— 6/17 

14/17 


8 

— 10 

— 6 

14 


ie the desired inTerae mstrix. Premultiplicstion of this by the given 
matrix will give a unit matrix of the fourth order, as the reader may verify. 


Example S. Invert the matrix 

■1.264 0.831 1.109" 

0.682 1.106 0.702 
0.967 1.342 0.642 


Solution. We have 


"IJM 

0.881 

1.109 

! n 


■1 

0.6627 

0.8844 

1 0.7974" 

0.682 

1.106 

0.702 

I ® 


0.532 

1.105 

0.702 

I 0 

0.967 

1.842 

0.642 

: 0. 


0.957 

1.342 

0.642 

: 0 


where we have divided row 1 by 1.254. Now multiply row 1 of the matrix 
on the right by 0.632 and subtract the result from row 2, and also multiply 
row 1 by 0.967 and subtract the result from row 3. We thus get 


[ 

[ 


1 0.6627 0.8844 j 0.7974" 

0 0.7524 0.2316 J —0.4242 

0 0.7078 —0.2044 J —0.7631 


0.6627 0.8844 0.7974 

a7684 0.2315 —0.4242 

0.7078 —0.2044 —0.7631 


0 “ 

1 

0 


End of step 1 



"0.6627 

0.8844 

0.7974 

; 0 1 


1 

0.3077 

— 0.5638 

: 1.3291 


0.7078 - 

-0.2044 

— 0.7631 

: 0 . 


Now multiply row 2 by 0.6627 and subtract result from row 1. Also * 
multiply* row 2 by 0.7078 and subtract result from row 3. Then we have 



890 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


[ 0 0.6805 1.1710 ; — 0.8808 ~| 

1 0.3077 — 0.5638 ; 1.3291 

0 — 0.4217 — 0.3640 | —0.9407 J 

[ 0.6805 1.1710 — 0.8808 ; 0~] 

0.3077 —0.5638 1.3291 ; 0 

— a4817 — 0.3640 — 0.9407 J 1 J 


End of step 2 


"0.6805 

1.1710 

— 0.8808 ; 

0 

= 0.3077 

—0.5638 

1.3291 ; 

0 

1 

0.8632 

2.2308 ; 

— 2.3714 


On multiplying row 3 by 0.6805 and 0.3077 and subtracting the respective 
results from rows 1 and 2, we get 


"0 

0.5836 

— 2.3988 I 

1.614 - 

0 

—0.8294 

0.6427 ; 

0.7297 

1 

0.8632 

2.2308 ; 

— 2.3714 


End of step 3. 


The required inverse is therefore 

0.5836 —2.3988 1.614 

—0.8294 0.6427 0.7297 

0.8632 2.2308 —2.3714 


When this is premultiplied by the given matrix, the result is 

0.9997 0.0000 0.0008“ 

— 0.0001 1.0001 0.0002 

— 0.0005 — 0.0009 1.0014 _ 

which is practically a unit matrix. 

h) Invenion in the more general case. In the preceding examples the 
pivot lines have been taken in consecutive order, always bginning with 
the first line of the given matrix. This procedure cannot be followed if 
the first element in the pivotal line is zero. Furthermore, if the first 
element in the pivotal line is not zero, it is smnetimes desirable to tain 
pivotal lines in any order. However, when the pivotal lines are not taken 
in consecutive order, the matrix obtained at the end of the last step will 
not be the desired inverse, but will be that inverse with its rows and 
columns permuted. The urntrix obtained in the last step must therefore 
be unscrambled to obtain the desired inverse. The unscrambling procsss 
will be explained by means of an example. 



Abt. 100] 


INVEB8ION OF MATRICES 


201 


Example 4* I'ind the inyene of the matrix 

"1 —2 3 4" 

3—1 2 6 

2 4—61* 

_4 2—1 3_ 

Solution. Here we take as pivotal row the one having the largeat element 
in the first column : the fourth row. Augmenting the given matrix by the 
fourth column of a unit matrix of the fourth order, we have 


"1 

— 2 

3 

4 

0" 


“1 —2 

3 

4 

0 " 

3 

— 1 

2 

6 

0 


3 —1 

9 

nr 

5 

0 

2 

4 

— 6 

1 

0 


2 4 

— 5 

1 

0 

4 

2 

— 1 

3 

1 


1 1/2 

-1/4 

3/4 

1/4 _ 


Subtracting row 4 from row 1, three times row 4 from row 2, and twice 
row 4 from row 3, we get 


"0—6/2 13/4 13/4 ; —1/4“ 

0 —5/2 11/4 11/4 ; —3/4 

0 3 —9/2 —1/2 ; —1/2 

_1 1/2 —1/4 3/4 j 1/4 


End of step 1 


“—6/2 13/4 13/4 —1/4 

— 6/2 11/4 11/4 —3/4 

8 —9/2 —1/2 —1/2 
1/2 —1/4 3/4 1/4 


0 

0 

1 

0 


“ — 6/2 13/4 13/4 —1/4 ; 

— 6/2 11/4 11/4 —3/4 ; 

1 — 3/2 — 1 / 6 — 1/6 ; 
1/2 —1/4 3/4 1/4 I 



Now adding 3/2 times row 3 to rows 1 and 2, and subtracting 1/2 of 
row 3 from row 4, we get 

“0 —1/2 17/6 —2/3 ; 6/6"j 

0 —1 7/8 —7/6 ; 6/6 

1 — 3/2 - 1/6 — 1/6 I 1/8 

0 1/2 6/6 1/8 ; — 1/6 


find of step 2 



IM SOLUTION OP SniULTANBOUS UNBAR BQUATION8 [Cur. XII 


r - 1/2 

17/6 

— 2/8 

5/6 

0 “ 

—1 

7/8 

- 7/6 

5/6 

1 

— 8/8 

- 1/6 

- 1/6 

1/8 ; 

0 

- 1/2 

5/6 

1/8 

— 1/6 ; 

0 


-—1/2 17/6 —8/8 5/6 I 0“ 

_ 1 —7/8 7/6 —5/6 I —1 

— 8/8 — 1/6 — 1/6 1/8 j 0 

1/2 5/6 1/8 —1/6 ; 0_ 

Adding 1/2 row 8 to row 1, 8/2 row 8 to row 8, and subtracting 1/2 
row 2 from row 4, we get 


0 5/3 —1/12 

5/12 

: -1/8“ 




1 —7/3 7/6 

0 —11/3 19/12 

— 5/6 
— 11/12 

I 

: -3/2 

End of step 3 


0 2 —1/4 

1/4 

; 1/8 j 




5/t —1/12 

5/12 — 

1/2 ; 1-| 




— 7/8 7/6 - 

-5/6 - 

-1 ; 0 




— 11/3 19/12 — 

11/12 — 

8/8 ' 0 




2 —1/4 

1/4 

o 

1 





1 

— 1/20 

1/4 

— 3/10 

8/6“ 


— 7/3 

7/6 

— 5/6 

— 1 

0 


— 11/3 

19/12 - 

-11/12 

— 3/2 

0 


2 

-1/4 

1/4 

1/8 

0 


Now add 7/3 of row 1 to row 2 and 11/3 of row 1 to row 3. Also subtract 
twice row 1 from row 4. Then we have 


1 —1/20 1/4 —8/10 ; 

0 21/20 —1/4 —17/10 ; 

0 7/5 0 -13/5 ; 

0 —3/80 —1/4 11/10 I 


3 / 6 " 

7/5 

11/5 

-6/5 


End of step 4. 


Hence the permuted inrerse is 

f—1/20 1/4 — 3/10 8/5- 

81/20 —1/4 —17/10 7/5 

7/5 0 —18/5 11/5 

— 8/20 —1/4 11/10 —6/5 


(A) 



An. 100] 


INVEB8ION OF MATRICES 


To unacramble this mtrix, ve first permute its rows and then permute 
the columns of the resultant matrix. In order to make the permutationa 
in a qrstematic and infallible manner, we construct a table showing the 
pivotal row used in each step of the transformation. The table for the 
present example is: 

Steps: 1 S 3 4 (for rows) 

Pivot rows: 4 3 3 1 (for columns). 

The use of this table is as follows: To find the rows in the first per- 
muted matrix, we fix our attenti<m on the numbers in the top row of the 
table and note the number directly under any particular number. For 
example, the number directly under 2 in the top row is 3, and this means 
that row 2 of the new matrix will be row 3 of the previous matrix (the 
scrambled inverse found in the last step of the transformation). 

To find the columns in the final inverse matrix, we fix our attention 
on the numbers in the bottom row of the table and note the number directly 
over any particular number. For example, the number directly over 1 in 
the bottom row is 4, and this means that column 1 in the final matrix is 
column 4 of the previous matrix (the one having the unscrambled rows). 
We now proceed to unscramble (A), first unscrambling the rows. 

To find row 1 of the next matrix after (A), we look for 1 in the top 
row of the table and find that the number immediately under it is 4. 
Hence row 1 of the new matrix is row 4 of (A). Likewise, under 2 of the 
top row we find 3 and therefore row 2 of the new matrix is row 3 of (A). 
The remaining rows of the new matrix are found in the same manner 
and we therefore get the matrix 

— 3/20—1/4 11/10—6/5 

.gv V5 0 -13/5 11/5 

' ' 21/20 —1/4 —17/10 7/5 

— 1/20 1/4 —3/10 3/5 

We now unscramble the columns by applying the table to (B). To find 
column l.of the final inverse matrix, we look for 1 in the bottom row of 
the table and find 4 immediately above it. Column 1 of the inverse is 
tiierefore column 4 of (B). To find column 2 of the inverse, we look for 
2 in the bottom row of the table and find 3 immediately above it. Column 
2 of the inverse is therefore column 3 of (B). The remaining columns 
are found in the same manner and we thus get 



m SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 

"—6/6 11/10 —1/4 —3/20“ 

11/6 -18/6 0 7/6 

' ' 7/6 —17/10 —1/4 21/20 ’ 

3/6 —3/10 1/4 — 1/20_ 

as the desired inverse matrix. The reader will find that if (C) be multi* 
plied by the original matrix the result will be a unit matrix of the fourth 
order. That shows that (C) is the correct inverse. 

The fact that the numbers of the pivotal rows were the inverse of the 
numbers of the steps in the above example was more or less accidental. 
The third row was used as pivotal row in the second step because its first 
element, 3, was the largest of the first elements in the remaining unused 
rows. In the third step there was little choice between the first and second 
rows. So we used the second row as pivot. 

In further explanation of the unscrambling process, we consider the 
simple matrix 


"1 —2 3" 

3—1 4 . 

_2 1 — 2 _ 

Carrying out the inversion transformations, we have 


"1 - 

_2 

3 

0“ 


"1 

— 2 

3 

0 " 

3 - 

-1 

4 

1 


1 

-1/3 

4/3 

1/3 

2 

1 

— 2 

0 


2 

1 —2 

0 





"0 

— 

5/3 

5/3 ; 

— I/S'" 





— 

1 

— 

1/3 

4/8 ; 

1/3 

End of step 1 




_0 


5/3 — 

14/3 ; 

- 2 / 3 _ 



r-5/3 

5/3 

— 

1/3 

• 

1 

on 

■ 0 

— 3 - 

-1 : 1 1 


- 1/3 

4/3 


1/3 

1 

1 

0 = 

0 

2/6 - 

■ 1/5 : 1/5 


L ®/3- 

— 14/3 

— 

2/3 

1 

1 

1 J 

5/8 — 

14/3 - 

- 2/3 1 1 



* 


r 

0 

— 3 

—1 

I 1 ~ 






= 

0 

2/6 1/5 

i 1/6 

End of step 2 




L 

1 

— 14/6 — 2/5 

: 3 / 5 . 



r 

— 1 

1 

1 

1 

1 

1 r 

1 

1/3 - 

- 1/8 I - 1 / 8 -] 

2/6 

1/6 

1/5 : 

0 

h 

8/6 

1/6 

1/6 ; C 


L - 14/5 

— 2/5 

3/6 ; 

0 

J L- 

- 14/6 - 

- 2/5 

8/6 ; C 

J 



Art. 100] 


INVERSION OF MATRICES 


m 



1/3 —1/3 
1/15 1/3 
8/15 — 1/3 


-1/3 - 

2/15 End of step 3 
— 14/1 5 J 


"1/3 —1/3 —1/3 ~ 

(A) 1/15 1/3 2/15 Permuted inverse. 

Ls/lS —1/3 — 14/15 _ 

The table showing the pivotal rows for the three steps is : 

Steps: 12 3 (for rows) 

Pivot rows: 2 3 1 (for columns). 

The table shows that row 1 of the next matrix is row 2 of (A), that row 
2 is row 3 of (A), and that row 3 is row 1 of (A). Hence we have 

"l/15 1/3 2/15 ~ 

(B) 8/15—1/3—14/16 
Ll/3 —1/3 —1/3 _ 

Now looking at the bottom row of the table, we see that column 1 of 
the inverse is column 3 of (B), that column 2 is column 1 of (B), and 
that column 3 is column 2 of (B). Hence we write 

2/16 1/15 1/3" 

(C) — 14/15 8/15 — 1/3 Inverse of given matrix. 

_ —1/3 1/3 — 1/3_ 

If we multiply (C) by the given matrix, we get 

1 0 0 " 

0 10 , 

0 0 1_ 

which shows that (C) is the desired inverse. 


The reader will note that in any step of the transformation the number 
of the augmenting column from the unit matrix must agree with the 
number of the row used as pivot. For example, if the third row of a 
matrix is the pivotal row, the matrix must be augmented by the third 
column of the unit matrix. 

The reader should also bear in mind that any row can be used as pivot 
only once in the transformation. In other words, a different row must 
be used as pivot for each step. 



SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chaf. XII 


101. Solution of Equations by Matrix Methods. In matrix notation 
any system of simultaneous linear equations can be represented by the 
simple equati<m 

(1) AT = k, 

where A denotes the matrix of the coefficients, x denotes a column matrix 
of the unknown x’s, and k denotes a column matrix of the known terms. 
On premultiplying (1) by A~\ we get 

(2) x = A-^k. 

Equation (2) gives the solution of the given system. From this it is 
seen that in the solution of a system of linear equations by the matrix 
method, the chief problem is the invetmon of the matrix of the coefficients. 
After the inverse matrix has been found, all the unknowns can be found 
in one short step. A few examples will show how this is done. 

Example 1. Solve the system of equations 

' 2af, — 2xi + 4®, = — 12 

. 2xi -|- 3xt -j- 2 j^ = 8 

a:,— *, = 7/2. 

Solution. In matrix form this system is written 


" 2—2 4 “ 

"* 1 “ 


■—12 ■ 

2 3 2 

Xi 

= 

8 

1 — 
1 

HA 

1 

HA 

1 



7/2 _ 


The matrix of the coefficients is the same as Example 1, Art. 100, wherein 
the inverse matrix was found to be 


— 1/2 1/6 — 8 / 5 ' 
0 1/6 2/5 

Ll/20 1 


Hence by (2) we have 




-- 1/2 

1/3 - 

- 8 / 5 “ 


■ — 12 ■ 

X* 

= 

0 

1/5 

2/5 

X 

8 



1/2 

0 

1 


7/2 _ 


Ob performing the indicated multiplication in the right member, we get 




■ 6 + 8 / 5 - 28 / 5 ” 


2 ■ 

^2 

= 

0 + 8 / 5 + 7/6 

= 

3 



__ 6 + 0 + 7/2 _ 





Am. 101] SOLUTION BY MATRIX METHODS m 

Xi = 2, *, = 3, », = — 6/2. 

BxamfU t. SdTe the qretem 

' *— 2y + 3*-|-« = 9/2 
3x— y + 2*4-6< = 19/2 
" 2x + 4y — 5z+ < = 15 
.4x + 2y — z-(-3< = 12. 

Solution. Here we have 


"1 —2 3 4“ 

~ x~ 


■ 9/2“ 

3—1 2 5 

y 


19/2 

2 4—6 1 

t 


15 

_4 2—1 3_ 

_ t _ 


_ 12 _ 


The matrix of the coefficients is seen to be the same as the matrix of 
Example 4, Art. 100, whose inverse was found to be (C). Then by (2) 
we have 




■—6/6 

11/10 

— 1/4 —3/20“ 


" 9/2" 


--1/2- 

y 


11/6 

— 13/6 

0 7/6 

X 

19/2 


2 

t 


7/6 

— 17/10 

— 1/4 21/20 

15 


— 1 

_ <_ 


3/6 

— 3/10 

1/4 — 1/20_ 


_ 12 _ 


3 


Hence 

* = — 1/2, y = 2, s = — 1, < = 3. 


It will be seen from the above examples that the solution of a system 
of non-homogeneous linear equations by the matrix method consists of two 
distinct operations: (1) inverting the matrix of the coefiicients and (2) 
premnltiplying the column matrix of the known quantities by the inverted 
matrix. 


nr. SOLHTIOH BY ITBSATHHI 

lot. System* Solvable by Iteration. All the preceding methods of 
solving systems of linear equations involve many subtractions of terms of 
the same order of magnitude. When such terms are nearly equal, their 
difference is nearly lero. The inaccuracies doe to this inherent weaknees 
(ff the methods cannot be entirely avmded. The best the computer can do 
is to treat all given quantities as exact numbers, use as many significant 
figures as practicable throughout the computation, and do as little rounding 



SM SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 

u possible until the final results are reached. The results will then be no 
more acenrate than the given quantities, and may be much less accurate. 

The method of iteration explained in Art. 81 is free from the inherent 
inaccuracy of the preceding methods. Moreover, it is a self-correcting 
method; any errors made at any step in the computation are corrected in 
the subsequent iterations. 

Unfortunately, however, the method of iteration is not applicable to all 
qrstems of equations. In order for iteration to succeed, each equation of 
the ^stem must contain one large coefficient (much larger than the others 
in that equation), and the large coefficient must be attached to a different 
unknown in each equation. This requirement is met when the large 
coefficients are along the leading diagonal of the matrix of the coefficients, 
as is sometimes the case. In solving a system of equations by iteration, 
each equation is first solved for the unknown having the large coefficient, 
thereby expressing it explicitly in terms of the other unknowns. Further 
steps in the process are best explained by examples. 

Example 1. Solve the following equations by iteration: 

'27a? -H 6y— s= 86 
(1) . 6x+16y+ 2s = 72 

X -|- y-\- 54s = 110. 

Solution. Since these equations meet the requirement for iteration, we 
solve each equation for the unknown having the large coefficient and thus 
get the system 


x = l(85-6y4.s) 

(«) 

II 

1 

1 

(0 

(b) 

*=^(110— »—y) 

(c). 


We start the iteration by putting y — O, s = 0 in (2) (a), thus getting 

*0. = |f=8.1.. 

Now substituting x = 3.16, s = 0 in (2) (b), we get 

y<*) = i (72 — 18.90) = 3.64. 

10 

Then putting x = 3.15, jf=:3.64 in (2) (c), we get 
s<*) = ^ (110 — 3.16 — 3.64) = 1.91. 



An. 102] SYSTEMS SOLVABLE BY ITERATION SM 

For the second iteration we have 

= ^ (85 — 21.24 + 1.91) =2.48 
y*) = i (72 — 14.58 — 3.82 ) = 3.57 

f(*) = ^ (110 — 2.43 — 8.57) = 1.926. 

By continuing in this manner and denoting the successive iterations by 
/i, /i, etc., we get the following table. 



X 

y 

z 

/. 

3.15 

3.64 

1.91 

/. 

2.43 

3.67 

1.926 

/. 

2.423 

3.574 

1.926 

/4 

2.425 

3.573 

1.926 

/. 

2.425 

3.573 

1.926 


The solution of the system (1) is therefore 

X = 2.425, y = 3.573, * = 1.926. 

Example 2. Solve the following system by iteration: 

' 3.122* + 0.5756y — 0.1565s — 0.00671 = 1.571 

0.5756* + 2.938y + 0.1103s — 0.00151 = -0.9275 
' — 0.1565* + 0.1103y + 4.127s + 0.20511 = —0.0652 
— 0.0067* — 0.0015y + 0.2051s + 4.1331 = — 0.0178. 


These equations meet the requirement for iteration. Solving each for 
the unknown having the largest coefficient, we have 


( 4 ) 


* = ^4^ (1.571 —0.5756y + 0.1565s + 0.00671) (a) 

y = ^4^ (—0.9275 — 0.5756* — 0.1103s + 0.00151) (b) 

s = —/5- (-0.0652 + 0.1565*— 0.1 103y — 0.20511) (c) 

t = (—0.0178 + 0.0067* + 0.001 5y—0.20Sls> (d) 


20 



300 


SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 


Putting y = 0, z = 0, ( = 0 in (a), we get 

*(‘)= = 

3.122 

Then putting x = 0.603, z = 0, t = 0 in (b), we get 

y(») = 5^ (—0.9276 — 0.6756 X 0.603) = — 0.414. 

To find z<*) we put x = 0.503, y = — 0.414, t = 0 in (c) and get 
z(») = (—0.0662 + 0.1665 X 0.603 + 0.1103 X 0.414) = 0.0143. 

Then for we have 

(—0.0178 + 0.0067 X 0.503 — 0.0016 X 0.414 

4.100 

— 0.2061 X 0.0143) = — 0.00435. 

To start the second iteration, we substitute in (4) (a) the above values 
of y<*>, z<‘>, and get 

xW = 0.680. 

Then 

y« = [—0.9275— (0.5756) (0.580) — (0.1103) (0.0143) 

— (0.0015) (0.00436)] = — 0.430. 

The reader will note that as soon as a new value is found, it is used at 
once in the immediately following equations. By continuing the iteration 
as outlined above, we get the following table: 



X 

y 

z 

t 

It 


— 0.414 

0.0143 

— 0.00436 

u 



0.0179 

— 0.00441 

/. 

0.6834 

— 0.4307 

0.01806 

— 0.004413 

u 

0.6836 


0.01806 

— 0.004413 

/. 

0.6835 

— 0.4307 

0.01806 

— 0.004413 


The solution of the system (3) is thus 

x = 0.6835, y = — 0.4307, z = 0.01806, < = — 0.004413. 

After one or two iterations in an example to which the method of iteiu- 
tion is applicable, the changes in the computed values of the unknowns 





Aw. 108) 


CONVERGENCE OF ITERATION PROCESS 


801 


ahonld be gradual. Erratic changes will usually mean that an error of 
some kind has been made in obtaining the erratic value, and therefore the 
value should be checked before proceeding further. 

108 . Conditfona for the Convergence of tiie Iteration Procen. In Art 
88 we derived the conditions for the convergence of the iteraticm process 
for a system of two equations of any nature. Those results can be 
extended to systems in any number of unknowns. For example, in the 
case of the qrstem in four unknowns 

z = F,(x,y,*,0 

z = F,{x,y,z,t) 
t = Ft(x,y,z,t), 


the conditions for convergence are 


dFi , 

dFa 


dFa 


dFa 

0x + 

dx 

4- 

dx 

+ 

dx 


and three similar inequalities involving partial derivatives with respect to 
y, z, and t, respectively. On adding the four inequalities and grouping 
the terms of the resultant inequality in the vertical direction, we readily 
see that the latter inequality is satisfied by the four conditions 


( 2 ) 


dFa 


0F, 

4- 

dFa 

4- 

dFa 

dx 

4 - 

dy 

dz 

dt 

dF, 

+ 

dFa 

4- 

dFa 

4- 

dFa 

dx 

dy 

dz 

dt 

10^. 


dFa\ 

4- 

dFa 

4- 

dFa 

1 dx 

4 - 

dy 

dz 

dt 

10^4 

+ 

dFa\ 

4- 

dFa\ 

4- 

dFa\ 

1 dx 

dy 1 

dz ' 

« 1 


<1 

<1 

<1 

< 1 . 


Writing the system 

«iJ -j- b,y -f- CiZ dit = Si 
+ bty 4- 0*2 + dit = e, 

OaX + bay 4- CaZ 4* = Sj 

O 4 X 4- bay 4* CaZ 4 - (lat = S 4 



802 


SOLUTION OF SIMULTANE»US LINEAR EQUATIONS [Cbaf. XU 


in the iteration form 


we have by (1) 


®= — (ei — hiy — Cl* — dit) 
dx 

y= ^ {ii—OiX—CiZ — dit) 
O2 

z— — {e » — Os® — b»y — <i,<) 

Ca 

<= -J- iet—aiX—hty—Ctz), 


Pi= — (ci — Ox — hty — Cl* — dit) 
fli 


Hence 



II 


a 2 X- 

1 

1 

10 

1 

-d,t) 




r.<“- 

a^x- 

1 

0 

1 

ea 

1 

-d,t) 



II 


■ (l 4 ^X ■” 

Wi 

1 

1 

— Ot). 


dFi 

II 

0 

h. 

dFi 

_ Cl 

dF, 

.A 

dx 


dz 

Oi ’ 

dt ~ 

■ Oi 

dF 2 

Oa 

1 

:> 

dF^ 

Cj 

dFt 

_A 

dx 

II 

1 

iT 

1 

dz 

“ 62' 

dt ~~ 

62 


etc. 

Substituting into (2) these values of the partial derivatives, we have 

lM±|^|±JAi < 1 , or |6.| + |t.| + |<i.|<|a.l 

< 1, or K| + |r.| + l<i.|<|6.| 
etc. 


Hence for a system of linear equations the sufficient conditions for the 
convergence of the iteration process are given by the following simple rule : 

The process of iteration will converge if in each equation of the system 
the absolute value of the largest coefficient is greater than the sum of the 
absolute values of all the remaining coefficients tn that equation. 



Am. 104] ERRORS IN SOLUTIONS DUE TO ERRORS IN COEFFICIENTS SOS 


This rule can be applied at a glance and is aeen to hold for the two 
examples worked above. 

Bemarhs. In the preceding pages several standard methods of solving 
systems of linear equations have been explained in sufficient detail to enable 
the reader to see the advantages and disadvantages of each. No one method 
can be called the best for any and all systems of equations that may arise. 
The method of iteration is probably the best method where it is applicable, 
provided the convergence is reasonably rapid. It is the best because it is 
self-correcting and is applicable to systems of any number of unknowns. 
The best of the remaining methods are the modified Gauss method of Art. 
96 and the method by inversion of matrices (Art. 101). 


101 Errors in the Solutions when the Coefficients and Constant Terms 
are Subject to Errors. In systems of linear equations occurring in applied 
mathematics the coefficients and constant terms are often subject to errors 
due to rounding or to uncertainties in experimental data. The solutions 
obtained from such systems will therefore be inaccurate to some extent. 
It is not possible to determine the exact magnitude of the errors in the 
solutions in such cases, but it is possible to determine the upper limits of 
the magnitudes of the errors. A method of determining the upper limits 
will be explained in this article. 

Let us consider the simple qrstem 


( 1 ) 


0,* = 04 
" biX + + 6«s = bt 

. CiX-f C4^-f C,* = C4, 


where the coefficients and constant terms are subject to the errors AUi, 
Aot, etc. 


If the exact values of the coefficients and constant terms are Ui AOj, 
Oa Aoa, etc., and the corresponding true values of x, y, and s are a; -|- Az, 
y -f- Ay, and s + ffie system (1) becomes 

' (Oi + AOi)(z + Az) -}- (o, + Ao,)(y + Ay) + (o, + Ao,)(f + As) 

= 04 -|-A 0 « 

(^1 + Abi)(z -|- Az) -f- {bt + A6t)(y -1- Ay) (6| -j- Aba)(s 4- As) 

' ^ ‘ =64 + Aba 

(Ca ACi)(z -|- Az) 4- (Ca 4* A08)(y 4" Ay) 4" (*• + ACa)(* 4" As) 

V. C4 "I* AC44 



aOLUnON OF OMULTAKBOUS LINBAB BQUATIONS [Our. XU 


PMfonning the indictted araltiplicatioiu in the left'hand memben, neg* 
lecting all tenns containing products of errors (such as Aoi^ etc.), and 
then subtracting the equations in (1) from the corresponding equations 
obtained from (9), we get 

{ aiAx + atAF + aaAs = Aa» — (xAoi + yAOi + aBOa) = hi, say 
biAx -f- (aAy -4- b»As = Abi (xAbi 4- yAbt ■+• sAba) = ha, ** 

CiAA + aaAjf + aaAf = A64 — («Aei + y^4 + *^aa) = ha, " 

Note that equations (3) are merely the differentials of the corresponding 
equations in (1) when the coeflSdents and unknowns are all regarded as 
variables. Note further that the matrix of the coefficients in (3) is the 
same as that in (1). 

The rig4tt>hand members of (3) are known quantities, since «, y, and s 
have already been found and the magnitudes of Aoi, AOs, etc., are assumed 
to be known. Hence we may regard the right members as constants and 
denote them by hi, ht, hs as indicated. Our problem now is to find upper 
limits for the magnitudes of the errors As;, Ay, As. 

If we solve (3) for Ax, for example, we have 


hi 

At 

At 

h. 

h, 

h. 

h. 

Cf 

At 

Oi 

At 

At 

hi 

h. 

ht 

Cl 

At 

At 


Let q denote the largest of the h*s, and let us assume for the moment 
that all the h*s are equal, so that 




Aar. IM] BRB0R8 IN SOLUTIONS DUE TO ERRORS IN CXIEFFICIENTS SOS 

Heie it is obTious that the magnitude of ta varies directly with the 
magnitude of g, and tiiat the upper limit of q will give the upper limit 
vt Ax. 

As the signs of the errors Aoi, Aat> etc. are not known, we are at liberty 
to assign to them any signs we please. We therefore assign to them such 
signs as will make all terms in the right-hand members of (3) positive. 
Let y be the upper limit of these errors (that is, none exceed y in magnitude). 
Then 

Aci^if, « = 1,2,8,4 
and 

(®) l®l -1-1^1 + l*l)y- 

To find the upper limits of the magnitudes of Ax, Ay, and As we there- 
fore replace the right-hand members of (3) by the quantity q as given 
by (6). Then the resulting system 

'OiAx-f o,Ay -f o,As = (1 -1- I X I -f I y I -I- I s |)ij 
(T) . 6iAx-f- bjAy -1- &|A 2 = (1 -f- 1 X I -|- I y I -1- I s |)i} 

^ CiAx-|-CtAy-|-c,As= (1-j- |x| 4- |y| -f |s|)ij 

is to be solved by any method which will guarantee that the right-hand 
members of any reduced systems will be as large as possible. (Solutions 
by determinants and matrices will not meet this requirement.) The best 
method of solution is the modified Gauss method of Art. 96. 

In eliminating the variables one at a time by that method (or any 
similar method), the left-hand members of the equations are subtracted 
as usual ; but since the signs of errors Aa,, Ab,, etc. are unknown, the right- 
hand members of the equations must be added arithmetically so as to 
guarantee that those members will always be as large as possible. We 
now illustrate the method by a simple example. 

Example. Consider the system 

1.22X— 1.32y + 3.96s = 2.12 

(a) . 2.12X — 3.62y + 1.62f = — 1 J86 

U.23X— 1.21y + 1.09s = 8.22, 

wherein all numbers are rounded and correct to the number of digits given. 
Hence 0.006. 



S06 SOLUTION OP SIMULTANEOUS LINEAR EQUATIONS [Chap. XII 

The solution of the above system is = 0.94386, y = 1.92724, s = 0.66866. 
Hence 


q—il + 0.94886 + 1.22724 + 0.66366) X 0.006 = 0.0191287. 


The system for the errors Ax, Ay, As is therefore 


(b) 


r 1.22AX — 1.32Ay + 3.96As = 0.0191237 
^ 2.12AX— 3.62Ay + 1.62As = 0.0191237 
.4.23AX — 1.21Ay + 1.09Az = 0.0191237. 


We solve (b) by the second method explained in Art. 96. Taking the 
third of equations (b) as the pivotal equation and dividing it throughout 
by 4.23, we have 


(c) Ax — 0.286052 Ay + 0.267683 As = 0.00462097. 

Now multiplying (c) throughout by 2.12 and 1.22 in succession, we obtain 


2.12 Ax — 0.606430 Ay + 0.646288 As = 0.00958446 
1.22 Ax — 0.348983 Ay + 0.314374AS = 0.00551658. 

Subtracting the left members of these equations from the left members 
of the corresponding equations in (b), but adding the right-hand members 
in each case, we obtain the reduced system 


— 2.913570 Ay + 1.073712 As = 0.0287082 

— 0.971017 Ay -f 3.646626 As = 0.0246393. 

Taking the first of these two equations as pivotal equation and dividing 
throughout by — 2.913670, we have 

(e) Ay — 0.368621 As = 0.00986326. 

Now multiplying (e) throughout by — 0.971017 and subtracting the 
resulting equation from the second equation of (d) (but adding the right- 
hand members), we get 


from which 


3.287787 As = 0.0342070, 
As = 0.010404. 


Values for Ay and Ax can be found by either of two methods: (1) by 
back substitution into (e) and (c), respectively, or (2) by starting with 
the given system (b) and solving for eadi error separately and indepen- 
dently of the others as was done in finding As. If the method of back 
substitution is used, all terms that are transposed to the right-hand members 



Abt. 104] 


EXERCISES 


807 


of (e) and (c) must be added arithmetically to the term already on that 
side. In the case of a system containing several unknowns, such continued 
adding of terms to the right members will give unnecessarily large values 
for the last errors found. In other words, the method does not give equal 
weight to all determinations. The second method is longer, but it gives 
equal weight to all determinations and is the correct method to use. 

To find Ay, for example, we write the given system (b) in the form 

8.96 As + 1.29 Ax— 1.32 Ay = 0.0191287 
1.62 As + 9.19 Ax— 3.52 Ay = 0.0191237 
1.09 As H- 4.23 Ax— 1.21 Ay = 0.0191237, 

take the first as pivotal equation, divide it throughout by 3.96, and thus 
obtain 

As + 0.308081 Ax — 0.333333 Ay = 0.00482922. 

The procedure from this point onward is exactly the same as in finding As. 
Ax is found by the same procedure as that just indicated for finding Ay. 

Note that in finding any particular unknown the given system of 
equations is written so that the desired unknown is in the last term in 
the left-hand members. 

The values found for Ax, Ay, and As by separate determinations as out- 
lined above are: 

Ax = 0.0087, Ay = 0.0141, As = 0.0104. 

In view of the fact that in finding the errors in the solutions as shown 
above, all errors were found by dividing the right-hand member of a simple 
equation of two terms by the coefficient of the error in that equation (e.y., 
3.287787 As = 0.034207, or As = 0.034207/3.287787), it is plain that by 
keeping the right-hand members of all equations as large as possible we get 
the m ax imum values of the errors. This is the reason for adding arith- 
metically all quantities in the right-hand members. 


BUKCI8B8 Zn 

1. Evaluate 

2—316 
1—243 
— 1 8 2 4 

8—121 


bj the pivotal method. 



908 SOLUTION OF SIMULTANEOUS UNEAR EQUATIONS [Cbaf. XII 

2. Evaluate the above detenninant by the triangular method. 

5. Solve for $ by Cramerie Buie: 

* — + — < = 10 
+ <= 4 

y + 4f— 8<= 7 
6« + 2y— 3« + 2<= 9. 

4. Solve the following system by the Gauss method of Art. 96 : 

2.38Xi + 1.96X,— 3.27a;. + 1.68a;« = 2.16 
3.21a;i **** 0.86a;. 2.42w. 3.20a;4 3.28 
1.44xi -I- 2.9&r,— 2.1Ar, + 1.86«« = 1.42 
4.17a;, + 3.62x, — 1.68a;, — 2.26x« = 6.21. 

8. Evaluate the determinant of the coefRcients in the above exercise. 
Hint: Use the product of the leading coeflRcients of the pivotal equations. 

6. Solve Exercise 3 completely by inverting the matrix of the coeflScients. 

7. Solve Exercise 4 completely by inverting the matrix of the coefficients. 

8. Solve the following system by the iteration process: 

0.89x + 4.32y — 0.47z + 0.96< = 3.36 
1.13X— 0.89y + 0.61s + 6.63< = 4.27 
6.32x—0.73y— 0.65s + 1.06< = 2.95 
0.74X + 1.01y + 6.28s —0.88< = 1.97. 

9. In the following system of equations the coefficients and constant 
terms are correct to the number of digits given, but no farther. Solve the 
qrstem and find the possible errors in y, and s. 

8.16X— 1.96y + 8.86s = 12.96 
2.18X + 6.12y — 2J9s = — 8.61 
6.92X + 8.06y + 2.16s = 6.88. 

10. The solution of the system 

2.82X— 3.96y + Alls + 8.86t( = ~8.08 
8.09X— 1.97y + 6.23s + 6.17w = — 1.18 
4.91x + 7.88y + 9.13« + 9.74tf= 8.69 

lJ4x—9.86y— 2.89s -~7J18tt= 2.16. 



Art. 104 ] 


EXERCISES 


« = 0.35949666, y = 0.44485660 
< = 0.71214877, « = — 1.12208255. 

If the coefficients and constant terms are correct only to the number of 
digits given, find the possible errors in x, y, z, and u. 



CHAPTER XIII 


THE NUMERICAL SOLUTION OF ORDINARY 
DIFFERENTIAL EQUATIONS 

I. EQUATIONS OF THE FIRST ORDER. 

105. Introdttctioii. Certain types of differential equations are dealt with 
in textbooks on calculus and differential equations, and methods are 
developed for solving equations of the types treated. Comparatively few 
differential equations, however, can be integrated in finite form. But just 
as there are methods for finding to any desired degree of accuracy the roots 
of any algebraic or transcendental equation having numerical coefficients, 
so likewise there are methods for finding to any desired degree of accuracy 
the numerical solution of any ordinary differential equation having 
numerical coefficients and given initial conditions. Starting with the 
initial values, the solutions are thence constructed by short steps ahead fcr 
equal intervals Ax = h of x, each step usually being checked b} some 
method before proceeding to the next step. The most important of the 
several methods for solving differential equations numerically will be 
explained in the following pages. 

106. Euler’s Method and Its Modification. The oldest and simplest 
method, but also the crudest, was devised by Euler. A differential equation 
of the first order may be written in the symbolic form 

( 1 ) ^ =/<*.») 

The integral of (1) gives y as a function of x, which may be written 
symbolically as 

( 2 ) y = F(x). 

The graph of (2) is a curve in the xy-plane; and since a smooth curve 
is practically straight for a short distance from any point on it, we have 
the approximate relation (see Fig. 13). 

i, = 4* tM * = (*)**. 

so that 

yi«yo-f 


310 



Art. 106] 


EULER'S METHOD 


311 


Then the ▼alues of y corresponding to «i(= Xi + ^)> ^ + h), etc. 

By taking h small enough and proceeding in this manner, ire could tabulate 
the integral of (1) as a set of corresponding values of x and y. Such was 
the method of Euler, but it is either too slow (in case h is small) or too 


Y 



inaccurate (in case h is not small) for practical use. Even if K is taken 
very small for all steps, it is evident from the figure and other con- 
siderations that the computed y’s will deviate farther and farther from the 
true y’s so long as the curvature of the graph does not change. These 
considerations have led to a modification of Euler’s method, as shown below. 

Starting with the initial value y«, an approximate value for y, is com- 
puted from the relation 


Then this approximate value of y, is substituted into the given equation 
(1) to get an approximate value of ^ at the end of the first interval, or 


818 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Ciup. XIII 


Then an improved value of Ap ia found by multiplying h by the average 
(mean) of the values of ^ at the ends of the interval to Xi, or 

That this value of Ay is more accurate than the value ^ ** evident 

if we think of ^ as the rate of change of y with respeot to x. The second 
ax 

approximation for yi is now 

This improved value of yi^*’ is now substituted into the given equation 
(1) to get a second approximation for > of 

(I) 


The third approximation for yi is then 



The process is repeated until no change is produced in the value of gi 
to the number of digits retained. 

The computation for the next interval Xt to carried 

out in exactly the same manner, by first finding an approximate value of 
Ay and then applying the averaging process until no improvement is 
made in gt. 

That this modification of the Euler method gives a great improvement 
in accuracy over the original method can be seen by a glance at Fig. 14. 



An. 106] 


EULER'S METHOD 


SIS 


Y 



Fio. 14 

In this figure the Ay computed by the Euler method is represented by KU. 
If PU is drawn parallel to the tangent at Q, the Ay computed by using the 
slope at Q is represented by KN. On the other hand, if we take the 
average of the slopes we get 

» = 1 [ ^ (g) + » (I) ] 

— i{KM 4- KN) — i{KM + Zlf + MN) ^KM-{ ^MN, 

which is very close to its true value KQ. The attainable accuracy in any 
case is limited by the length of the step k. 

Although the modified Euler method is slow and of limited accuracy, 
its simplicity and applicability make it a method of great value; for it enables 
one to start the solution of problems where no other method will work. 

First approximations to * * etc. could be found by means of 

the formula 

as was done above ; but as soon as two consecutive values of y are known, 
the first approximations to succeeding y’s can be found more accurately 
from the formula 



314 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

(3) + 

To derive this formula, let the function y be represented in the neigh- 
borhood of by Taylor’s series. Then 

(•) «».+*) =/(*.)+»/'(*.) +^n*.) +5^r(*.) 

Subtracting (b) from (a), we get 

(c) /(*,+A)-/(i.-»)=Mf(*.)+^r(*i)+^/'(*.)+- • •• 
In terms of y, (a) and (c) may be written 

(d) yii»i = y« + *y'ii+yy"ii + ’ ■ 

(e) yn*A = y»-i + 8^/. -f y -f- ■ • 

When h is small and only the first two terms in the right-hand members 

A* A* 

of (d) and (e) are used, the truncation errors are — y"« and — 

A u 

respectively, and the latter is much smaller than the former. Hence (e) 
gives a more accurate value of yn*i. 

The first approximations to y found from (3) are to be corrected and 
improved by the averaging process described above. 

The principal part of the error in the final value of y can be found 
as follows: 

Since the increment in y for each step is obtained from the formula 

A, = A(i!^S!j2!^, 

the right-hand member of which has the form of the first group of terms 
in Euler’s quadrature formula, the principal part of the error in Ay is 

- ^ [/"(»...) -r(*.)i =-^r(« 

by the theorem of mean value, where x^^i. 



EULER'S METHOD 


315 


Am. 106] 

As SB example of the use of the modified Euler method, we compute a 
few values of p for the differential equation 

with the initial conditions iEo = 0, po = 1> 

Substituting these values of x and p in the given equation, we have 

= *0 + Po = 0 "1“ 1 1* 



Then 


dxj» 


Taking h = 0.06, we then have 

p^(0 = p, + = 1 + 006 = 1.06. 

= *1 + Pi<»> = 0.05 + 1.05 = 1.10. 

The second approximation to pi is therefore 

+ (ivy 

Pj(*) = Po + fc = 1 + ^ X 0.05 = 1.0526. 

The second approximation for then 

= 0.05 + 1.0525 = 1.1026. 

Then the third approximation to pi is 

y^^•) = 1 + ^ + 11025 ^ Q Qg _ ^ 05266. 

Continuing the approximation, we have 

(^)‘* = 0.06 + 1.05256= 1.10266, 

= 1 + ^ ^ y0266 ^ ^ j 05256 

Since *hie is the same as Pi**^ we can get no further change in p by con- 
tinuing the approximations. We therefore take 


y, = 1.0626, = 1.1026. 


21 



316 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


As 6 first approximation to y, we have, by (8), 
(y,) <*) = ! + 0.1(1.1026) = 1.1108. 


Hence 

Then 

and 

Hence 


= 1.1104, 


f g ) = 0.1 + 1.1108 = 1.2108. 

(y,) <*) = 1.0526 + 0.05 
(^1 =0.1 + 1.1104 = 1.2104. 

\oX/t 


(y,) <») = 1.05^6 + ®-®M^-l®^® + l gl®4) _ 1 1104, 


which is the same as yt<*). We therefore take 

y, = 1.1104, =1.2104. 

Collecting our results in tabular form, we have the following table: 


X 

V 

dy/dx 

0.00 

1.0000 

1.0000 


1.0526 

1.1026 

0.10 

1.1104 

1.2104 


The question now arises as to the accuracy of the results found above. 
Fortunately, the exact analytical solution of the given equation, with the 
stated initial conditions, is easily found to be 

y = 2s* — « — 1. 

For x = 0, 0.05, 0.10, the corresponding values of y are 1, 1.05254, and 
1.11034. The values in the table above can be improved only by taking a 
smaller value for k. 


107. Picard’s Method of Sttcceasiwe Approzimations. From the 


equation 


§=/(*.»). 



dx. 


we have 








Aasr. 107] PICARD’S MRTHOD OF SUCCESSIVE APPBOXIlfATIONS SIT 
Integrating this between corresponding limits for x and y, we bare 

sy = »)* = X.* (S) 

from whidi 

(1) y = y» + y)<** = yo + d». 

Here the integral term in the right-hand member represents the increment 
in y produced by an increment z — Xq in z. 

Confining our attention for the moment to the first form in (1), namely, 

y=y*+ 

we notice that the equation is complicated by the presence of y under the 
integral sign as well as outside it. An equation of this kind is called an 
integral equation and can be solved by a process of successive approxima- 
tions, or iteration, if the indicated integrations can be performed in the 
successive steps. 

To solve the differential equation 

by Picard’s method of successive approximations, we get a first approxima- 
tion for y by putting yo for y in the integrand of (1). Then 

y(*)=yo-i- f*/(af,yo)<fe. 

The integrand is now a function of x alone and the indicated integration 
can be performed, in theory at least. Having now a first approximation to y, 
we substitute it for y in the integrand of (1) and integrate again, thus 
obtaining a second approximation 

yW =yo-H J*J/(*,y<»>)(i!r. 

The process is repeated in this way as many times as may be necessary or 
desirable, the nth approximation being given by the equation 

y(«) = y, f */(*» 

We now apply this method to the simple example 

*=*+»■ 

with the initial conditions z^ = 0, yo = 1. 



318 SOLUTION OP ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

To get a first approximation we substitute y = 1 in the right-hand member 
of the given equation, thus obtaining 

,<■> = 1 + = J + + 1 = I + X + 1. 

For second and third approximations we have 

y<*) = l+ + + * + 

y<»> = l+ J*(x+j-\-x* + x + l^dx:= ^ + ^ + *. + *+1. 

We have thus found y as a power series in x. For x = 0.1 we have 

y= + 0.01 -f 0.1-1-1 = 1.1103. 

This value of y is correct to four decimal places, as shown on page 3)26. 
For x = 0.2 the corresponding value of y<*) is 1.2427, whereas the true 
value is 1.2428. We could get a better value by continuing the approxima- 
tions to y(*), y^*\ etc. ; but it is better to move up to the point x = 0.1 and 
start all over again. 

The graphs of y<‘>, y<*’, y<*>, and y = F(x) are shown in Fig. 15. It will 
be seen that the approximating curves approach the curve y = F{x) more 
closely with each successive approximation. 

Now taking x = 0.1 and y = 1.1103 as initial values, we have 

y<*» = 1.1103+ f*(a: + 1.1103)diF 

o.i 

= ^ + 1.1103X + 0.9943. 

Then for second and third approximations we get 
y‘*> = 1.1103 + JJ (*+?■*" 

= ^ + 1.0552X* + 0.9943X + 1.0001. 

y<») = 1.1103 + J* * (* + 7 + 1.0552i» + 0.9943i + 1.0001 ^ dx 

= ^ + 0.3517X* + 0.9972X* + l.OOOlx + 1.0000. 

For X = 0.2 we get y = 1.2428, which is correct to four decimal places. 

We could now move up to the point x = 0.2 and start over again ; but 
since this method is not much used in practice, we shall not continue the 



Abt. 107} PICARD'S METHOD OF SUCCESSIVE APPROXIMATIONS 


S10 



Fio. 15 


computation by this method. 

Tbe practical diflBcultiee associated with the method aa outlined above lie 
mostly in the difficult and sometimes impossible integrationa which would 
often have to be performed many times over. For example, if we wished 
to solve the equation dy/dx— (y — x)/{y-\-x) with the initial conditions 
Xo = 0, yo = h should have 

1 -f 2 In (1 -(- «) — *, 


y(t) 



1 2 In (1 -f g) — X — X , 

1 + 2 In (1 + g) — * -H * " 





320 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


and our troubles would continue to pile up as we continued the approxima- 
tions. The difficulties would be far greater in other examples which might 
come up for solution. Fortunately, such difficulties, and indeed all direct 
integrations, can be ayoided by the methods to be explained in the next 
two articles. 


108. Use of Approximating Poljmomials. We avoid the difficulties just 

dy 

mentioned by replacing ^ ^ polynomial and then integrating this 

polynomial over any desired interval. The appropriate polynomial for 
this purpose is that given by Newton’s formula (II), because in the 
numerical integration of differential equations we always start with given 
values (initial conditions) and construct the solution from that point 
onward. Hence the values of the function immediately behind us are 
always known, but the values ahead are unknown. The problem is always 
to find the next value ahead. 


Writing / in place of ^ and replacing y by y' in formula (II), p. 60, 

we have 


«(«-}- 1) 


M(tt+l)(tt-f 2) 


(1) = 3^* 4* "I" 2 g 


A*/, 


. tt(« + l)(u + 2)(« + 3) . 

+ ,4 AuC. 


= /» + Ay,tt 4- 


Aty’. 


(u*4-«)4- 




(u‘ -f 3tt* 4" 2tt) 


(tt« + 6u»4-llu*4- 


where 


« = 


or X = 4* 


Since the change in y for any interval is given by the formula 


wc can find by means of (1) the change in y over any interval where 
dy/dx is continuous. We therefore have for any interval Xjui — x* 

Ay = [] y'- 4- Ay.u 4- (“* 4- tt) 

^ ^ («• + 8tt» 4- 2tt) 4-^ (tt* 4- 4- lit** 4- fit*)] dx. 



Abt. 106] 


USE OF APPROXIMATING POLYNOMIALS 


Since x = we have dx = hdu. Substituting this value for ds 

above and changing limits, we get 

Ay = h [ y'. + Ay.tt + ^ (u» + «) + ^ (u* + 3«» + 2u) 

+ ^ (tt* + + lltt» + 6tt^ dii, 

or 

<*) = * [/.» + + ¥=(t +t1+¥<t +«•+ -) 

Let us now compute the value of Ay for the intervals x«,i — x» — Xm-i, 
Xn-i — x^i, etc. by substituting in (2) the proper limits for u. For the 
interval Xi^i — x« the limits for u are 

U^l — (X||,i — Xn)/h — h/h — 1, ttk — (x* — X||)/A — 0. 

On substituting these in (2) and simplifying, we get 

4, = / ;;•* = i [j<, + 1 4,/. + A 4^. + 1 A.,-. + III 4,/. ] . 
For the interval x, — Xn-i the limits for u are 



and therefore 

Proceeding in the same way for the other intervals, we get formulas 
for the changes in y in those intervals. The results for the several 
intervals are: 

(loe. i) /" = A [sf-. + 1 Ay. + ^4^. + 1 4.,'. + A.,-. ] , 

(106. S ) 1 II .1 = A 2 W — 720 ^*^"3 ' 

(100.3) 2 18^’^* 720 ^*^"3 * 

(100. 4) 1 a,/. + g4y. - 1 Ay. - ^ Ay. ] , 

(100. 3) /« = A - 1 Ay. + ||4y. - H Ay. + m Ay. ] . 



322 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

251 

In tile Application of foxmnlAs (108. 1) and (108.8) the ooefl^dents — 

19 11 

and may be replaced by g and ~ , respectively. 

By adding (108.1) and (108.8) and then (108.8) and (108.8), we get 
the following additional formulas: 

<ioA«) i"\ =»[s/.+|A.,',+ i4y.+^4y.] 

= ** [ly”. + j(^. + M'. + ^•> — 

(lOAT) + 

Aeplacing the first and second differences in (108. 7) by their values in 
terms of the y^a, namely 

Ay* = sr'* — y'*-i , Ay, = y", — + y',.* , 

% 

and simplifying, we may write (108.7) in the equivalent form 

(lOA «) =1 + 4/.., + ay. . 

which we recognize at once as Simpson’s Rule with its remainder term. 

Instead of using (108. 1) for oomputing tile first approximation to Ay 
in the next step ahead and checking it by (108.2), we may use explicit 
formulas for the approximate and corrected values of the next y ahead. 
Since 

f*"** — y*»i — y*» 

we may replAoe U**' by this value in (108. 1) and obtain the formula 

(108. •) 9 m = y, + kty'. + 1 Ay, + A Ay, + 1 A.y', 1 a*/,], 

a bar being placed over y,„ to indicate that it is a first approximation 
obtained extrapolatkm. Then when (108.8) is applied to the new 
interval, the n and n — 1 in that formula become n -f- 1 and n, respectively. 
Hence the corrected y at the forward end of the new interval is given by 
the foimnla 

(108. 10) y,,, 

~ y* ^ AiO^w.1 — AsO'bm Ay,»i — — j. 



Art. 108] 


USE OF APPROXIMATING POLYNOMIALS 


32S 


where is the value of ^ obtained from the given equation when the y 
therein is replaced by the found from (108.9). 

In like manner, when (108.6) and (108.8) are replaced by explicit 
formulas for y, we get 

(108. 11) ViHi = yii-i + 2A[y'i, + g (A 2/11 + As/. + •) — 

(108. 18) yM — y«-i j (/ -I- ■}* yVi) — ^ As/.*,, 

respectively, the being found from the given equation as above. 

Now a word as to the use of the foregong formulas. Formulas (108. 1), 
(108.6), (108.9), and (108.11) are formulas for integrating ahead by 
extrapolation. They are used to start a new line in the computation and 
are used only once in each step-interval. 

Formulas (108.9), (108.7), (108.8), (108.10), and (108.19) aroused 
for checking and correcting the extrapolated values found by the formulas 
for integrating ahead. They may be used more than once in any step* 
interval. 

The intervals covered by these formulas are shown graphically in Figure 
16. 

■ I I I I I I — 

n-4 n-3 n-2 n-l n • n+l 

Fio. 16 

Formulas (108. 1^ and (108. 9) are the main tools which we shall use in 
the numerical integration of ordinary differential equations. It is needless 
to say that all the foregoing formulas apply equally well when the variables 
are any quantities whatever — time and acceleration, time and velocity, etc. 
Their use will be illustrated by several examples in the pages ahead. 

Historical Note. The method of replacing the derivative of a function 
by a polynomial and integrating that polynomial over an interval was used 
by J. C. Adams as early as .* Adams derived formulas (108.1) and 
(108.9), but he did not use (108.1). He used (108.8) in the manner 
indicated on page 397 and then applied a correction formula. 



824 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIll 


Example. We return once more to the differential equation 


dx 


x + y. 


In Art. 106 we computed the entries in the following table, except that 
now we have added on the columns of differences. 


X 

V 

y' 

AiX* 



1.0000 





1.0626 

1.1026 

-(•0.1026 


0.10 

1.1104 

1.2104 

-f-0.1078 

-1-62 


Before proceeding further with the computation we had better check 
the values already found. If x» denotes the third value of x in the table, 
then the second and first values will be Xn-i and 'Xn-t, respectively. (See 
Fig. 16.) To compute the increment in y for the first interval and thereby 
find jfi we apply formula (108.8), since it covers the interval x^x — x».t. 
We therefore have 

Ay — 0.05 1^1.2104 — I (0.1078) + ^ (0.0062)] —0.06264. 
yi — yo + Ay — 1.0526. 

For the second interval we apply (108.2). Then 

Ay — 0.05 [l.2104 — | (0.1078) — ^ (0.0052)] — 0.06780. 

The corrected values of y are therefore y^ — 1.0626, 
y, — 1.0526 + 0.0578 — 1.1103. 

We now make a new table containing the corrected values for y, /, and 
the first and second differences of y'. We also insert in this table a column 
for Ay as a matter of convenience. 


X 

V 

Ay 

y' 

A.y' 

Aiy' 

Aiy' 


1.000 


1.000 




0.06 

1.0626 

+0.0626 

1.1026 

+0.1026 



0.10 

1.1103 

+0.0678 

1.2103 

+0.1078 

+63 


0.16 

1.1736 

+0.0633 

1.3236 

+0.1133 

+66 

+2 


1.2427 

+0.0691 

1.4427 

+0.1191 

+66 

+1 


The computation is continued by adding a new line to the above table. 






















Abt. 106] 


USE OF APPROXIMATING POLYNOMIALS 


325 


the line for z 0.15. The first step is to compute a new Ay by means of 
formula (108.1), using the data of the third line: 

Ay — 0.05 [l.2103 + 1 (0.1078) (0.0053)] —0.0633. 

y ,<0 . 1.1103 -f- 0.0633 — 1.1736. 

Then 

(/),<») — 0.15 + 1.1736 — 1.3236. 

The next step is to enter these values of y and y' in the fourth line of the 
table and then compute the differences of y^, as shown in the table. The 
entries in this line must now be checked and improved upon if possible 
by means of formula (108.2). Tlius, 

Ay — 0.05 [^1.3236 — | (0,1133) — ^ (0.0055)] — 0.0633. 

Since this is the same value for y as previously found, there is no possibility 
of improving upon the results in the fourth line and we therefore take them 
to be correct to four decimal places. 

The fifth line in the table is computed in exactly the same way and is 
found to be correct at the first trial. 

The fact that the correct values of y were found at the ffrst trial in 
lines four and five suggests that it may be expedient to double the interval 
of integration, in order to progress more rapidly. We therefore take 
k — 0.10 and make a new table with differences to correspond to the longer 
interval. 


X 

y 

Ay 

■i 

Ay 

w 

Aiy' 

Ay 

■El 








0.1 

1.1103 

+0.1103 

1 2103 

+0 2103 




0.2 

1.2427 

0.1324 

1.4427 

+0 2324 

+221 



0.3 

1 3995 

0.1568 

1 6995 

+0.2568 

+244 

+23 


0.3 

1.3996 

0.1569 

1 6996 

+0.2569 

+245 

+24 


0.4 

1.5835 

0.1839 

1.9835 

+0 2839 

+270 

+25 

1 

0.5 

1.7973 

0.2138 

2 2973 

+0.3138 

299 

+29 

4 

0.6 

2.0441 

0.2468 

2 6441 

+0 3468 

330 

+31 

2 

0.7 

2.3274 

0.2833 

3.0274 

+0.3833 

365 

+35 

4 

0.8 

2.6510 

0.3236 

3.4510 

0.4236 

403 

38 

3 

0.9 

3.0191 

0.3681 

3 9101 

0.4681 

445 

42 

4 

1.0 

3.4364 

0.4173 

4.4364 

0.5173 

492 

47 

5 

1.0 

3.4365 

0.4174 

4.4365 

0.5174 

493 

48 

6 


To start the line for z = 0.3, we first compute Ay by means of (108. 1 ), 












886 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

using the data in the line for x — 0.2. We have 

Ay — 0.1 [1.4427 + 0.1162 + 0.0092] — 0.1568. 

Hence y — 1*2427 + 0.1568 — 1.3995, and (/)!*,> = 1-6996. We now 
enter these values in the table and compute the differences for that lino- 
Cheddng these values by means of (108.2), we get 

Ay — 0.1(1.6995 — 0.1284 — 0.0020 — 0.0001) — 0.1669. 

Since this value of Ay is different from that previously found, we repeat 
the line for x 0.3 and write this value of Ay in the new line. The second 
approximations for yo.« and (y')os ere then 

yiV— 1.2427 + 0.1669 — 1.3996, 

(y')iV— 1.6996. 

Entering these values in the new line, computing the corresponding dif- 
ferences, and then applying formula (108.2) to the data of this line, we 
have 

Ay — 0.1 (1.6996 — 0.1284 — 0.0020 — 0.0002) — 0.1569. 

Since this is the same value for Ay as previously found, we consider the 
results in this second line for x -i- 0.3 to be correct. 

The computations are continued up to x — 1, as shown in the table. 
It so happens that formula (108. 1) gives the correct result for every lin* 
except the last. Fourth differences were used in formula (108.1), but 
not in (108.2). 

Since the exact solution of the differential equation dy/dx — x -{- y, with 
the initial conditions Xo 0, yo — 1, is 

y — 2«* — X — 1, 

we can compute the exact value of y corresponding to any value of x. The 
following table gives the correct values of y for values of x differing by 
one tenth. 


X 

V 

X 

V 

0 

1 

0.6 

2.0442 

0.1 

1.1103 

0.7 

2.3275 

0.2 

1.2428 

0.8 

2.6511 

0.3 

1.3907 

0.9 

3.0192 

0.4 

1.5836 

1.0 

3.4366 

0.5 

1.7974 





Aw. lOB] 


METHODS OF STARTING THE SOLUTION 


827 


It will be noticed that the values found by numerical integration are 
in error by one unit in the last decimal place, beginning with the value for 
X ■■ O.S. The truth is that the source of these errors is in the value 1.2427, 
which is in error by one unit in the last figure. This error was simply 
carried on by addition throughout the table. To avoid such errors it is 
necessary to have the first two or three lines in the table correct. 

Note. There is another method for starting a new line in the table 
without the use of formula (108. 1). It consists in assuming that the highest 
difference in the next line will be the same as in the line just finished, and 
then working backwards by adding the new differences to the values in 
the previous line. For example, suppose we take the line for x « 0.8 and 
try to find the next line. We have 


z 

V 

Ay 

y' 

A,y' 

Aty' 

Aiy' 

0.8 

2.6510 

0.3236 

3.4510 

0.4236 

403 

38 

0.9 



(3.9187) 

(0.4677) 

(441) 

(38) 

0.0 

3.0191 

0.3681 

3.9191 

0.4681 

445 

42 


The first step in this procedure was to assume that the third difference 
in the line for x = 0.9 was 0.0038, the same value as given in the line above. 
Then we added this 0.0038 to the second difference 0.0403 in the line above. 
This gave us a second difference for the new line. We added this 0.0441 
to the first difference in the line above and obtained a new first difference 
0.4677. This was then added to the previous y' to get the value 3.9187 for 
in the new line. 

The next step is to apply formula (108.2) to this new line, using the 
quantities enclosed in parentheses (these quantities are enclosed in paren- 
theses to indicate that they are trial or assumed values). We thus get 

Ay — 0.1 (3.9187 — 0.2338 — 0.0037 — 0.0002) — 0.3681. 

This value of Ay happens to be correct. We now add this to the previous 
y to get the new value of y and thus complete the line. But now thb new 
/ must be computed by adding the value of x to this new y. We therefore 
repeat the line for x — 0.9 and insert the correct values of all the quantities. 
In some instances it would be necessary to correct this second line. 

The method just outlined in this note is the one used by J. C. Adams and 
F. B. Houlton. It is not as much trouble to apply as it may seem from the 
description above, but nevertheless it requires more labor than the method 
of integrating ahead by (108. 1) and will therefore not be used in this book. 


109. Metiiods of Stortliig tiio Solutioii. The determination of the first 














928 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

few Talues of fhe function is the most important and usually the most 
laborious part in the numerical solution of a differential equation. It is 
the meet important part because the first few values must be aceuraU to 
tho number of sigtUfieant figures deeired tn the solution, and it is the most 
laborious because the first few values are sometimes not easily found to 
the desired accnnugr. 

Formulas (108.1) and (108. S) involve the first four differences of the 
function /. These differences can be constructed only when five consecu- 
tive values of g' are known. The first value of / can be found from the 
given equation and the initial values of z and y; the remaining four can 
be found by one or more of several methods, the most important of which 
are the following: 

1. By Toglor's Series. If y = fix), the familiar Taylor formula 

(1) /(*)=/(*.) +r(*.)(*-»i)+^ (»-*.)’ 

may be written in the less familiar form 

(2) y = yo-f-/«(x — So) +^(* — a’o)*4- ^ 

+ «o)* + - • • 

where Xo aad yo denote the initial values of x and y. In finding y*s by 
formula (2), it is desirable to keep \x — Xo\ numerically small in order 
to have rapid convergence of the series and therefore high accuracy in 
the y*s. Hence in general we should work on both sides of the point Xo : 
that is, we should compute y*s both to the right and to the left of the 
point x = Xo. Using the notation y, = /(xo -|- ^)» y» = /(®o + 2fc), 
y.i=3/(xo — h), y., = /(xo — 2h), etc., we have from (2) : 




4 ! 


51 


(») = M M +• 

t 


(•) y-i=yo— 1^0(2*) + 




tTom* . y»%(2k)« ir,(2h)» . 

3 ! ■^41 6 ! 


where h denotes the interval between the equidistant values of x. 



Art. 100] 


METHODS OF STARTING THE SOLUTION 


320 


If the snccessiye dematiyes of the given differential equation = f{x, y) 
are easily found, the five needed consecutive values of / can be found from 
the initial conditions, the given equation, and the formulas (3) -(6) above. 

Example 1. Let the given differential equation be 

^=/ = * + y, with *, = 0, yo = l. 

Here 

Hence 

Sf'o = ®o + y. = l, y''o = l + yo = 2, = = = = y’^ = 2. 

Now taking \ = 0.1 and substituting in (3), (4), (5), (6), we get 

yx = 1.1103, y, = 1.2428, y., = 0.9097, y.j = 0.8375. 

These values are all correct to four decimal places. The five desired con- 
secutive values of y' are now found from the given equation to be: 

^.2 = a ;-2 + y-, = — 0.2 -f 0.8375 = 0.6375 
= x,i + y., = — 0.1 -I- 0.9097 = 0.8097 
/o = *0 + yo = 1 

y', = X, -I- yi = 0.1 + 1.1103 = 1.2103 
/, = x* + y* = 0.2 -f- 1.2428 = 1.4428. 

If the function y should be non-existent for values of x less than Xq, 
then we compute y’s only to the right of Xo by substituting in (2) the 
proper values of h. 


2. By Milne's Formulas. It frequently happens that the higher deriva- 
tives of y' = f{x, y) cannot be found without excessive labor, or hardly 
at all. In such cases the above method cannot be used for starting the 
computation. If, however, the first derivative of y' = /(x, y), or can 
be found without difficulty, the five starting values of y' can be found by 
certain formulas first used by W. E. Milne.* To derive these formulas 
we need two additional Taylor series similar to (3) and (5). 

Representing the function y', neighborhood of x = Xo by 


Taylor’s series, we have 
(!') y'i = y'* + y".fc + 


I y»a» . 
2 ^ 


y%^ , 
4! “*■ 


* American Mathematical Monthly^ Vol. 48 (1941)» p. 52, 



380 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


( 8 ) 




trJi* y''oh,* 

31 4! 


Adding (7) and (8) and then subtracting (8) from (7), we get 


(s) »'.+!('-.=v.+r.»’+*^+ ••• 

(10) = + + - • •• 

Now solve (9) for (10) for y'%, and then substitute these values in 
(3) and (5). The results are 


(A) = yo + ^ (y'-i + lOy'o + 7y\) + ^ 

(B) y.jzryo — — (7y'., -j_ IS/o + y'J ^ifo" 


These formulas give y^ and y.i as soon as y'.i and y'l are known. 

To find formulas giving y^ and y. 2 , substitute in (4) and (6) the values 
of y^^o and y'^'o found from (9) and (10). The results arc 


(C) ». = y.+y (Sy'i— y'.— /-i)— 

These formulas will give y 2 and y .2 as soon as y'.i and y'l are known. 

An additional formula is desirable for checking and y .2 when found 
from (C) and (D). Subtracting (B) from (A), we get 

yi — y.i=- (y + 4/0 + y',) 

or 

yi = y-« + ^ (/-* + 4/0 + y'l) — • 

Since this formula holds for any interval of width 2h, we may write it as 
a general formula 

(E) y*« = y-t + f iy'n-i + 4/. + M - ^ . 

The quantity^ (/«-! + 4y', + y^n+i) i® evidently Simpson’s Rule and is 

an approximation to the definite integral which represents the 

increment in y for the two intervals from — A to x, -{- A. 

In the application of formulas (A)-(B) the terms in y\ are omitted. 



Akt. 109] METHODS OF STARTING THE SOLUTION 881 

The formulas as used are thus accurate up to and including fourth 
differences. 

It is to be noted that the second derivatiTe y'' is to be eraluated only 
at the one point (xq, jfo ) . 

Concerning the use of the foregoing formulas, the first step is to 
compute trial values of jf'i and from the relations 

(F) y'l = y'o + y'-t = y'o — liy"o (Euler method). 

Then substitute these in (A) and (B) to get first approximations to jft 
and y.i. These approximate values of y, and y.,, with the corresponding 
Xt and x.i, are then substituted into the given differential equation 
y' •—/(£, y) to get improved values for y'l and y'.,, which in turn are 
substituted back into (A) and (B) to get better values for y^ and y.t. 
This iteration process is continued until no change is produced in y't 
and y'-i. To obtain high accuracy in y'., and y't the value of h must 
be small. 

Now having the three consecutive values y'-t, y'o, y'l to the desired 
degree of accuracy, we substitute them in (C) and (D) to get approximate 
values of y 2 and y.j by extrapolation. Then these, together with Xg and 
are substituted into the given equation to get approximate values of y't 
and y'.j. These latter are then substituted into formula (E) to get 
improved values of ya and y.j. If these agree with the previoiis values 
found by extrapolation, we take them as correct. Then these values when 
substituted into the given equation will give correct values for y't and y'.t. 
We thus obtain the five needed consecutive values of y' to give us the 
various orders of differences to use in ( 108. 1 ) . 

Note. After having found y'.,, y'o, and y't to the desired degree of 
accuracy, we could form first and second differences from these and then 
proceed with formulas (108.1) and (108.2). 

Example 2. To find five consecutive values of y' for starting the solu- 
tion of the equation 

/ = > withyo=l, *0 = 0, 

WG hftVG 

./ _ _ (g* + y*) jxy' + y) — xy{2x + 2yy') 
y- (** + y*)* 

Hence 

y'o = 0, y"o = 1. 

Taking h = 0.1, we have from formulas (F) 

^2 = 0 + 0 . 1 ( 1 ) = 0 . 1 , y '., = 0 - 0.1 =- 0 . 1 . 


22 



332 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


Then by (A) and (B), 

= 1 + ^ (-0.1 + 0.7) + = 1.0060, 

= 1 (— 0*7 + 01) = 10060. 

Now substituting these y*s and the corresponding 2 *s into the gi?en 
equation, we have 

(0.1) (1.006) _ 0.1006 _ 0 03353 
*" (0.1)* + (1.006)* ~ 0.01 + 1.010026 “ 

y'.i = — 0.09863. 

On substituting these into (A) and (B), we get 

yt<») = 1 + (—0.09863 + 0.68971) + 0.0025 = 1.00496, 

y.i<*) = 1.00496. 

Now substituting these into the given equation, we have 


^ _ (0.1) (1.00496) _ 0.100496 

~ (0.1)* + (1.00496)* ~ 1.01994 

y*.i =— 0.09863. 


0.09853, 


As these are the same values as previously found for y' i and we take 
them to be correct. 

Having **dug in,^* as it were, about the point (3«,yo) and found three 
consecutive values of y' to the desired degree of accuracy, we next find 
y -2 and j/t by extrapolating backward and forward by means of formulas 
(C) and (D). From (C).and (D) we have 


y, = 1 + ^ (0.59118) — 0.02 = 1.0194, 
3 


A O 

y., = 1 — ^ (— 0.69118) — 0.02 = 1.0194. 
3 


These values must now be checked by formula (E) after first finding 
y's and y'.j from the giving equation. Substituting in the given equation 
the value of X 2 (— 2k 0.2) and the value of 1/2 found above, we have 


, 0.2(1.0194) 

(0.2)*+ (1.0194)* 


0.20388 

1.07918 


= 0.18892. 


Likewise, for 
we get 


x.t(*— — 2k — — 0.2) and the value of y.t found above, 


. _ —0.2(1.0194) 

"* (—0.2)* + (1.0194)* 


=z — 0.18892. 



Amt. 109] METHODS OF STARTING THE SOLUTION SSS 


Then from fomraU (E), 

Vt = Jfo + ^ (y'o + 4/i + y',) 

= 1 H ^ (0 + 4 X 0.09853 + 0.18892) = 1.0194. 


For checking the backward value y.j we write formula (E) in the trana- 
poeed form 

y-i — — I (yVi + 4y'» + 

Hence we have 


y-i — yo — I (/.* + 4y'-i 4- /o) 

= 1 — ^ 0.18892 — 4 X 0.09853) = 1.0194. 


Since these values are the same as those found by extrapolation, we con- 
sider them to be correct. 

Now having five correct consecutive values for y and y', we can form 
differences up to and including the fourth for these quantities and proceed 
with the numerical solution by means of formulas (108.1) and (108.2). 

At this point it is instructive to compare these computed values of y and 


y' with their exact values. The homogeneous equation y' = ^ can 

readily be solved by the usual artifice of putting y = vx. The solution, 
with the given initial conditions, is found to be 

*• = y* In y*. 


The Newton-Raphson method, when applied to this equation, shows that 
yi should be corrected by the amount 0.000003 and y 2 by the amount 
0.00003. The values previously found are thus true to the number of 
significant figures retained in the computation. 

3. By the Runge-Kuita Method. The use of this method will be ex- 
plained in a subsequent article. 

4. By the Modified Evler Method. This method has been explained in 
Article 106. It can be used for starting the numerical solution of any 
ordinary differential equation and is used when the previously-explained 
methods cannot be used to advantage. 

After the five consecutive starting values have been found, the numerical 
solution of a differential equation is continued as far as desired by means of 
formulas (108. 1) and (108. 2) . This part of the solution is mostly smooth 
sailing. If the differences higher than the second become negligible, it will 
be well to double the interval h. When this is done, a new table of differences 



S34 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


must be constructed for the wider interval, using the previously-computed 
values of ^ for this purpose. 

If, on the other hand, the fourth differences should become large, or if 
several trial computations should be required to obtain the correct result, 

or if the term — should equal or e.xceed half a unit in the last decimal 
3 

place (or in the last significant figure retained in the computation), then 
the interval h should be rcduc-cd by half and the computation continued 
with the shorter interval as far as may seem necessary. The best way of 
reducing the interval is explained below. 

110. Halting the Interval for h. Since the process of reducing the 
interval is the same as beginning a new solution of the given equation, it is 
absolutely necessary that five con.spcutivc values of y' be known accurately 
in the region where the new computation is to begin. The best way to find 
accurate values at the midpoints of old intervals is to use Bessel’s formula 
for interpolating to halves, namely : 


( 110 . 1 ) = 


1 ^ + ^*y ' . 

8 2 ■^128 “2 


where y^^is the value of y’ halfway between y',, and y',. Note that the 
differences used in this formula are ordinary diagonal differences. It will 
be necessary to find the value of y' at two midintorvals in order to have five 
consecutive values for starting the computation with halved intervals. 

Example. The numerical .solution of the equation y' = x -|- y is tabu- 
lated on page 325 from x = 0 to r = 1. Suppose it were desired to reduce 
the interval by half from the point z = 0.6 onward. 


Solution. We take z = 0.6 as the zero point and then rewrite the dif- 
ferences of the y^’s in diagonal-difference form, as shown below; 
















Abt. 110] 


HALVING THE INTERVAL FOR h 


336 


Then for x = 0.95, we hare 


S^o.e» — 


2.6441 + 3.0274 1 0.0365 + 0.0403 


8 


2 


+ ^ ^ ^ = 2.8358 — 0.0048 = 2.8310, 

IIqo « 


and 

^ 3.0274 + 3.4510 1 0.0403 + 0.0445 

jro.Te - 2 g 2 = 3.2339. 


The five starting values for the new table, considering only y', would be 
as shown below: 


X 


A,v' 


Aiv' 



2 6441 





0.65 

2 8310 

+0.1869 




0.70 

3.0274 

0.1964 

95 



0.75 

3 2339 

0 2065 

101 

6 



3.4510 

0.2171 

106 

5 

-1 

0.85 

3.6792 

0.2282 

111 

5 


0.00 

3.9191 

0.2399 

117 

6 

1 


Continuing the computations, we apply (108. 1) to the line for x = 0.80 
and get 


Then 

Hence 


Ay = 0.05 [3.4510 + 0.1086 + 0.0044 + 0.0002] = 0.1782. 
yoa» = 2.6510 + 0.1782 = 2.8292. 

/oa. = 0.85 + 2.8292 = 3.6792. 


We next fill in the line for ;e = 0.85 and apply (108. 2) to this line as a 
check. We have 


Ay = 0.06[3.6792 — 0.1141 — 0.0009] = 0.1782, 
which is the same as before. 

We continue the table through the line x = 0.90 to see if the new schedule 
with halved intervals gives the values previously found. Applying (108.1) 
to the line for x = 0.85, we have 

Ay = 0.05 [3.6792 + 0.1141 + 0.0046 + 0.0002] = 0.1899. 

Hence 

y,.M = 2.8292 + 0.1899 = 8.0191. 













38» SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS {Chap. XIII 
Then 


/oM = 0.90 + 3.0191 = 3.9191. 

These are the same values as previously found before the interval was halved, 
and they indicate that no error was introduced in changing to the shorter 
interval. It is to be noted that the shorter interval reduces the fourth 
differences to insignificance. 


UEICISBS xni 


1. Obtain by the modified Euler method five consecutive starting values 
for the numerical solution of 


dx 



9 


with Xo = 20, yo = 3. Check the starting values by formulas (108. 2) to 
(108. 6) and then add two more lines to your table. 

2. Obtain by Taylor’s series five consecutive starting values for the 
numerical solution of 


dx 


= 2a; — ti. 


with Xo — 1, jfo = 3. Check the values and then add three more lines to 
the table. 

Compare your results with those obtained from the exact analytical solu- 
tion y = 2x-\- 3s*"* — 2. 

3. Tabulate the numerical solution of 


= sin X 4- cosy 
ax 

from Xo = 30®, yo = 45® to x = 60®. 

4 . Use the Taylor-series method to start the numerical solution of 


dx 


= *• + 9 *, 


with Xo = 1, yo = 0. 

5. Oiven the equation 



Abt. Ill] 


EQUATIONS OF THE SECOND ORDER 


S37 


with «• = !, jro=l/4; find and then find five consecutive 

starting values by means of the Euler method and the Milne formulas. 

Note that a solution exists only in the regions where | x/2y \ > 1. 

fi. Integrate the equation of Ex. 2 by Picard’s method of successive 
approximations. 

n. EQUATIONS OF THE SECOND ORDER AND SYSTEMS OF 
SIMULTANEOUS EQUATION& 

111. BquAtioiis of the Second Order. Any differential equation of the 
second or higher order can be reduced to a system of first-order equations 
by the introduction of auxiliary variables. Thus, the second-order equation 

g+i>|+e»=/(*) 

can be reduced to two first-order equations by putting / = dy/dx. The 
resulting equations are 

In like manner, any equation higher than the second order or any system 
of equations of the second or higher order can be reduced to a system of 
equations of the first order. These first-order equations can then be solved 
by the methods already given, or soon to be given. 

Second-order differential equations can also be integrated numerically 
by the following formulas, adapted from (108.9) and (108. 10) : 


(1) = /.+ »(A + i ^ ay-. + 1 Arti". + 1 

for getting the trial value of /imi* 

(fi) ^ Yg ^ Aj^^i 

38 

for getting a first approximation to the new y^x- 

( 8 ) = — P^M~~Qy**xt 

the given equation, for getting the first approximation of 



838 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


( 4 ) 4 " — 2 ^'^***^ — ^ 

for checking and correcting the trial found from (1). 

Similar mtegration formulas can be obtained from (108. 11) and (108. IS). 
The overlined quantities y^, ^ are so marked to indicate that 

they are the first approximations to y, and at the point 


Example. When a pendulum swings in a resisitng medium, its equation 
of motion is of the form 

, d9 ... . . 

5^ + «^ + ».m* = 0, 

where a and b are constants. Assuming a = 0.2, b = 10, start the solution 
of the above equation, taking as initial conditions 8 = 0.3 radian and 
dd/dt = 0 when t = 0. 


Solution. The substitutions d9/dt — $, d*$/dt* = ^/dt = $ reduce the 
given equation to the two first-order equations 


dt 

dt 



— 0.28 — 10 sin 8. 


Since the second equation involves 8 directly, it is necessary to compute 
this angle at every step throughout the computation. Also, since 8 in this 
problem is always expressed in radians, it is practically necessary to use a 
table of sines in which the argument is given directly in radians.* 

After the starting values have been found, the solution of this example 
will be continued by means of the following formulas, used in the order 
written : 


(1) 4#= j]'”' *<« = W(A + 1 aA + ^ 4A + g aA + i aA), 

for starting a new line. 

^•♦1 1 * 1 * 

(2) = fdt = — 12 M 



Abt. Ill] 


EQUATIONS OF THE SECOND ORDER 


339 


for finding the first approximation to d at time 

(fi) Ahx — 10 sinPa^i, 

for finding the first approximation to d at time 

for checking and correcting the value of Ad found by (1). 

• •• 

The d, d, and d for the instant are underlined to indicate that 
they are the first approximations. 

If the first cycle of calculations for a step interval does not give results 
of the required accuracy, a new computation is made by applying (2), 
(3), and (4) in the order here given. 


The starting values for this problem can be found by any of the methods 
mentioned in Art. 109. We shall find them by the Taylor-series method. 
The Taylor series for d is 


(6) d = d, + do<+ 


31 41 61 6 ! 


From the given equation d = — 0.2d — 10 sin d we get 
d = — 0.2d — lOd cos d 
d'^' = — 0.2d + 10(^ sin d — d cos d) 
d* = — 0.2d" + 10[(d* — d) cos d + 3dd sin d] 
d»» z= — 0.2d^ ■+- 10 [ (3d*d — d*'^) cos d — (d* — d d) sin d 
+ 3(d*doosd + d*sind + dd8in d)]. 


For d = 0.3 radian, d = 0 when t = 0, the above equations give 


do = — 10 sin 0.3 = — 2.9652, 

(j; = — 0.2d« = 0.2 X 2-9662 = 0.69104, 
di^o = — 0.2di — lOdo cos 0.3 = — 0.1182 + 28.2321 = 28.1139, 
d», = — 0.2d‘»o — lOdi cos 0.3 = — 6.6228 — 5.6464 = — 11.2692, 
d*«o = — e:2d^o — lOd*^ COS 0.3 + 30i^ sin 0.3 = 2.254—268.682 + 77.426 
= — 188.903. 



340 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


Substituting in (6) these Tulues and the initial conditions, we get 

' 2 ^ 6 ^ 24 120 

188.90 ,, 


Differentiating (6) with respect to t, we hare 

(7) = + 

We are now r^ady to find starting ralnes of fi, d, and 9 for values of t 

near aero and at steps of ^ sec. Putting ^ ~ ^ “ (®) 

and (7) and then computing the corresponding values of 9 from (1), we 
get the first five values of these quantities as given in the table on the 
following page. 

After forming the various orders of differences for these quantities, 
we are ready to extend the table by applying formulas (1), (2), (3), 
(4), etc. Two additional lines computed in this manner are given in the 
table. The actual computation of the line for 1 = 0.15 was as follows: 

Applying formula (1) to the acceleration quantities m the line for 
i = 0.10, we have 

a9 = — [— 2.758 + 0.066 + 0.029] = — 0.1332. 

20 

This is the first entry in the new line for t 0.15. Adding this Ad to 
the previous value of 9, we get — 0.4211 for the new 9. Now compute 

the various orders of differences for the new 9. 

We next compute A9 for this line by applying (2) to the 9 quantities. 
We thus have 

A9 = — [— 0.4211 -f 0.0666 — 0.0007 — 0.0001] = — 0.0178. 

20 

Adding this to the previous value of 9, we have 

9o.„ = 0.2854 — 0.0178 = 0 2676. 

We next substitute in the given equation (3) the values of 9 and $ for 
the new line, in order to get the acceleration when t = 0.15. We thus 
have 

9 = — 0.2 (— 0.4211) — 10 sin 0.2676 = — 2.560. 

Then we compute the several orders of differences for this acceleration. 


































342 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

The line for < = 0.15 is now completely filled, but it must be checked. 
Hence we apply formula (4) to the acceleration data last filled in and have 

Ad = ^ [— 2.560 — 0.01)9 — 0.005] = — 0.1332. 

Since this is the same value of Ad as previously found, there is no possi- 
bility of improving any of the entries in the line for t = 0.15 and we 
therefore regard them as correct. 

Succeeding lines are added to the table in exactly the same manner as 
above described. 

In this example the time interval Af must be taken short, because d, d, 
and d are all changing rapidly. To obtain results accurate to four 

significant figures A/ should be kept at ^ second. 

Space docs not permit the higher differences of d to be shown in the 
table, but these differences should always be computed as a check on the 
accuracy of the computed values of d. 

Another check formula for second-order differential equations is (112. 2) 
of the next article. 

Let us check the value of do 20 in the above table by formula (112. 2). We 

have 

do.,0 = 2(0.2676) —0.2854 -f [— 2-560 + (0.061)] = 0.2434, 

which is the value already found and checked by (2). 


112. Second-Order Equations with First Deriyative Absent. When 

second-order differential equations do not contain first derivatives, their 
numerical solutions can be found by a shorter method than that employed 
in the preceding article. In this case we replace the second derivative by 
a polynomial and integrate twice to find the desired formulas for numerical 
integration. 

We consider first the single equation 


( 1 ) 


d*y 

dx‘ 




which we write in the form 


( 2 ) 




To find a formula for integrating ahead by extrapolation we start with 
Newton’s formula II, in which we replace y» by y’'*. We thus have 



Art. 112 ] 


EQUATIONS WITH FIRST DERIVATIVE ABSENT 


843 


+ .(« + l)(^+3)(« + 3) ^. + ..., 

where u = — ^ or * = x* + hu. Integrating (3) with respect to x and 
remembering that dx — hdu, we have 

y' = A[tiy"* — Aiy", 4" 4* g 4“ i** 4* 

, 1 /«*, 3tt* , lltt* , , A . y/ 1 , 

+ 24 ( 5" + T + 1“ + 

We determine Ci from the condition that when x = x» and 

therefore u = 0. On putting u = 0 and y' = y'» in (4), we find Ci = /». 

Now replacing Ci by y'n in (4) and integrating again with respect 
to X, we get 

y = kuy\ 4- A* »"« 4- ^ A.y", 4- 4- A,y", 

, 1 / u* tt* , u*\ „ 1 /u* 3«* , 11«* , s\ A ”1 _i_ n 


(4) 


We find C2 by putting y = y^ when 11 = 0 and thus find C 2 = y*. Then 
we have 

(5) y = yn 4- fc«y'« + ** • 4- 1- ^ 2 /'. 

. l/u» , u* tt»\^ „ , 1 /«•, 3«’ llu* ,\^ .,-1 

6 \20 4 3 / 24 \30 10 12 *^ “ / "J ’ 

The values of y for x = x„»x and x = Xn-i are found by putting u = 1 
and u = — 1 in (5). We thus obtain 

(6) y,*x = y» + hy'n 4* ** Q y"« 4- |^>y"" + f ^ • 

4-^Ay\] 

(7) ynr-i = yn — hy'n 4- fc* Q y"« — ^ ^y'* — ^ ^2^^ — ^ A,y", 

480 " J 



844 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIIL 
Adding (6) and (7), we get 

Sf»*i + |f»-i = -f- 12 Aiy"* + Asy " n + 

19 1 1 

Now replacing ^ by jg we finally obtain 

(8) = *».-»„ + A-f/”, + i (4.jr. + Arf". + Arf”.) 

~840^’l’ 

which is Stormer’s extrapolation formula for integrating ahead.* 

To derive a formula for checking and correcting we start with 
Stirling’s interpolation formula with y replaced by namely, 


(9) y" = A + « 


A/Vi + A/^, 


+^Ayv.+ 


«(«• — !) 




, X x% 

where u = — ^ — 


and therefore ix = hdu. 


Integrating (9) twice with respect to x and determining the constants 
of integration from the conditions y' = /n and y = y« when u = 0, we get 


( 10 ) 

• •• 

Now putting 11 = 1 and tt = — 1 successively in (10) aud adding the 
resulting equations, we obtain 

yiKi = 2y*— yi .^1 + ^ ^y Vi — ^ Ay Vi). 

On changing to horizontal differences by means of the relation 6!^y% 
= we finally get 

(11) y**! = 2y» — y*_i + k*(y^^* + Yg AyVi 2 ^ Aoy'Vt). 



Art. 112] EQUATIONS WITH FIRST DERIVATIVE ABSENT 846 

In the applications of (8) and (11) the terms and 

^^A 4 y'Vt Are neglected. Hence the working formulas are 

(112. 1) ^ + A,/^ + 1 

for finding the approximate value of y»tt and 

(112. 2) yn^i = Sy, — y, , + fc*(y^', + ^ A|^'«*i) 
for checking and correcting the y,«, found from (112.1). 

Formulas (112. 1) and (112. 2) give a step-by>step solution of the equation 
d’y 

- 7 ^, = f(x,y) with given initial conditions. The first is a formula for 

OX* 

integrating ahead and finding the approximate value of yn*i by extra- 
polation. The extrapolated value is checked and corrected by (112. 2). The 
starting values of y and y'' are to be found by the methods given in Art. 109. 

Example. Tabulate the solution of 

d*y 

^ — sin y -f 1 = 0,ory" = sin y— 1, 
with the initial conditions y = 0.1132 and ^ = 0 when x = 0. 


SoltUion. We find the first few values of y from its Taylor expansion 
about the point x = 0 . Starting with the given equation y'' = sin y — 1, 
we have 

dy 

y"' = cos y = / cos y 
dx 

y"' = — y'* sin y -f y" cos y 
y'' = y'" cos y — Oy'y" sin y — y'* cos y. 

Substituting in the above equations the initial values y = 0.1132, y' = 0 
when * = 0, we get 

y"o = sin (0.1132) — 1 = 0.112958 — 1 = — 0.88704 

^"'0 = 0 

y^\ = — 0.88704 cos(0.1132) = — 0.88704 X 0.99360 = — 0.88136 

y\ = o. 



346 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


Substituting these values in the Taylor formula 


we get 

(a) 




41 


y = 0.1132 — 0.44351* — 0.03672®*. 



By means of this equation (a) we compute the y's in the first five rows 
of the following table. Then we substitute these y*s in the given equation 
to find the corresponding values of y" in the seventh column of the table. 
The differences Aty'* are then computed. 

From this point onward wc continue the computation with h = ^ = 0.05 

by means of formulas (112. 1) and (112. 2), always applying (112. 1) first 
and then checking and correcting the new row by (112.2). Hence to get 
started on the sixth row we apply (112. 1) to the last row found by Taylor’s 
series and get 

y, = 2(0.1088) -0.1121 + ^ [-0.8914 + ^ (- 0.0022 )J 
= 0.1033. 


Then we substitute this value of y« in the given equation and find 
— — 0.8969. The sixth row is completed by filling in the new 
differences. 

We now apply (112. 2) to the new sixth row as a check on its correctness. 


We have 

y, = 2(0.1088) —0.1121 + ^ [—0.8914 — 4 (0.0022) J = 0.1033 

4UU 1 w 


as before. Hence we consider the sixth line to be correct. The succeeding 
lines are computed in the same way. 

The differences of the y’s are not used in the computation, but they 
should be computed as a check on the accuracy of the work. Irregularities 
in the higher differences would indicate that a mistake had been made in 
the computation. 

Systems of two, three, or any number of simultaneous equations of the 
second order in which first derivatives are absent can be solved numerically 
by formulas similar to (112. 1) and (112. 2). Thus for a system of three 
equations 






S48 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Cup. XIII 


(112.8) ^ = = 


each equation is int^prated separately by means of its own formula 
analogous to (112.1). Then the extrapolated yalues of x, y, s are sub- 
stituted into the right-hand members of (112.8) to get j/'*, if* at the 
new point ahead. Then new differences are computed, and formulas 
analogous to (112.2) are applied to the new rows as checks. 

The necessary formulas for the three equations (112.8), for example. 


are 

(11*. 4) 


' x^i = 2xm — *11-1 + + ■A(^***^ ■ "i“ "h Ai*'' ,)] 

. $»»i=2yM — yn-i » "t* 4* 

^ i»»l = 2Zn 9n-i -f- "f" -f- 


for finding approximate yalues at the next point ahead, and 


( 11 *. 8 ) 


= 2*»-**-i -f h*(*". + 

y»*t = 2y« — ym-i + 

S»*I = 2s» — Sa.1 -f- k*{if*m 4* iV^*^^***) 


for checking and correcting the new values given by (112.4). Here 

h — tiifi ““ t*. 

Of course t may be absent from the functions ft, ft, in the right-hand 
members of (112.3) just as * was absent from the right-hand member of 
the equation = sin y — 1. 

If the functions ft, ft uve easy to differentiate, the first five values 
of X, y, z needed to start the computation can be found from the initial 
conditions and from the Taylor expansions of x, y, z, each as a function 
of t; if the three functions are not easy to differentiate, the beginning 
values must be found by the Milne method or by the modified Euler 
method, using short intervals of t. 

118. Systems of Simultaneous Equations. We have already dealt with 
certain systems of simultaneous equations in reducing a second-order 
equation to two first-order equations. In the present article we consider 
more general types of simultaneous equations. 

Example. Required the numerical solution of the simultaneous equations 



Art. 113] 


( 1 ) 


SYSTEMS OF SIMULTANEOUS EQUATIONS 

d*x dy 


349 


dx 


with the initial conditions z = 1, y = 2, — =r 0, when t = 0. 

at 

Solution. To integrate these equations numerically, we first write them 
in the forms 

(2) y = — sint), 

(3) i' = y — 2z + y + In cos t. 

To get the starting values we use the Taylor-series method. Assume 

(5) y‘=yo + yo<+ -y + -^ + -jT-+ + - 

From (2) we get 

y = i(T + z — cosO, y = i(» + i-|-wnO, etc.; 
and similarly from (3), 

x = y — 2i 4* y — tan t, x^ — y — 22 + y — sec* t, etc. 

Using the initial values and putting t = 0 in the above equations, we have 


^• = 1/2 

y. = i(i-l)=-l/4 

y.=i(i + i)=3/8 


yiv. = __.,yv. = _ 


37 

32’ 


yv«,; 


37 

64 


x, = 2 — 2 + i = l/2 
ii = 1/2 — 1/4 = 1/4 
z‘^ = — 1/4 — 1 + 3/8 — 1 
x% = -7/16, x^ = 


16/8 


Note that these successive coefficients are found alternately by a zigzag 
procedure, starting with then going to 2, then back to etc. 



850 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 
On rabotituiing in (4) and (6) the coefficients just found, we get 


(«) 

it) 


x = l 

^4 ^84 64 


7 1 

— ! — <• _L £_ £• 

1920 ^ 2660 * 


y = 2 + s- + A ^ 

' ^2 8 ^ 16 384 


37 


3840 


<» — 


37 


46080 




Also, from (6), 

(8) i = + 

On putting < = — 0.10, —0.06, 0.06, 0.10 in (6), (7), (8), and using 
the initial values, we get the values of x, y, and x given in the first five lines 
of the following table. The corresponding values of ^ and 2 are found 
from the given equations (2) and (3), by using the proper values of x, y, 
X, and i. 

The computation is continued by means of the following formulas: 

(») Ai = *[* + i-4.*+^A.* + |lW( + iAJ!]. 

for starting a new line; 

(10) Ax = fc[x — I-Aix— ^ A,x— ^ A,x— ^ A 4 X], 

for finding Ax in the new line; 


(11) ^ = ^(x + ^ — sint), for finding y in the new line; 

(18) 2 = y — 2x 4 - y + 1 r cos <, for finding 2 in the new line; 

(18) = 


for cheddng Ax in the new line; all formulas to be used in the order given. 

114. Cmiditions for Cmwergenco. The conditions for the convergence 
of the numerical solution by approximating polynomials can be arrived at 
most easily by means of the Picard process. 

For the simple equation 



Art. 114] 


CONDITIONS FOR CONVERGENCE 


861 


1 

€0 

\ 

Cl o 

< 

•H C9 

1 

o e 


1 1 1 

-48 

-48 

'•i 

< 

^ ^ 

+ 

CO M 

T 

iH 

0.4657 

0.4852 

0.5000 

0.5102 

0.5156 

0.5162 

0.5120 

< 

CO 

1 

CO Cl 

1 

< 


*-4 

1 

•& 

< 

a> o 00 

02 oo 

< 

02 o o c<i 

CO eo N ^ 

^ p-4 ^ 

1 1 1 1 

7 ' 

•7h 

0 5269 

0 5130 

0 5000 

0 4880 

0 4768 

0.4665 

0.4570 

•j 

<j 

CO 

1 

CO d 

1 

•» 

< 

^ "O' 

1 i 

^ CO 

1 1 

•j 

< 

00 M 

+ 

Cl 

1 

H 

< 

238 

246 

253 

256 

00 

^ lO 

Cl Cl 

•H 

-0 0484 
-0 0246 

0 0000 

0 0253 

0 0509 

0 0767 

0 1024 

< 

Q CO 

« »o <0* -o* 

Cl C« Cl 

CO ^ 

CO CO 

Cl d 


1 9487 

1 9747 

2 0000 

2 0247 

2 0488 

2 0724 

2 0955 


00 CO CO o> 

7 ' + + 

+32 

+45 

H 

1 0024 

1 0006 

1 0000 
1.0006 
1.0025 

1.0057 

1.0102 


-0.10 

-0.05 

0 

+0.05 

0.10 

0.15 

0.20 



852 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

the Picard process converges to the true solution provided 



0y |maz 


in the region of integration; and in the case of the two simultaneous 
equations 

the conditions for convergence are 


At < 


3/x 


dx 


max 


+ 


3/. 


dx 


and At < 


max 


9A 


dy 


max 


+ 


a/2 

ay 


max 


in the region considered.* 

Now since a polynomial can be made to approximate any continuous 
function to any required degree of accuracy, it follows that the approxi- 
mating polynomial used for the numerical solution of a differential equation 
can be made to approach the Picard solution by taking h sufficiently small. 
Then since the polynomial solution can be made to coincide with the Picard 
solution as closely as desired, it is evident from the geometric significance 
of partial derivatives that the conditions for the convergence of the poly- 
nomial solution are the same as for the Picard solution when h is suffi- 
ciently small. 

In the simple equation 


v'=f{^>y) 


the numerical process of solution will fail in a region where 


dy 


> 00 


* The proof of theee ooBditione will be found in the flret edition of this book. 



Abt. 115] 


MILNE’S METHOD 


363 


•nd in th6 case of simultaneous equations the process will feii in a region 
where any one of the partial derivatiTes 

^Jx ^ dji ^ 
dx* Zx * dy * dy 

becomes infinite. Before starting the numerical solution of a 
equation, it is well to examine the partial derivatives for the range or 
region to be covered. 

ni. OTHn METHODS OF SOLYIHO DOTBBBMTIAL EQUATIONS 

HUMEBICALLY. 

116. lUlne’i Method. A simple and reasonably accurate method of 
solving difileiential equations numerically has been devised by W. E. MUne.* 
It does not employ differences, but uses two quadrature formulas — 
one for integrating ahead by extrapolation and the other for checking the 
extrapolated value. These formulas are derived from Newton's formula 
(I), p. 68. 

That formula in terms of y' and u is 

where « = — or x = ®o + Aw. Integrating this formula over the 
interval Xo to Xb + 4k, or u = 0 to u = 4, we have, since dx = hdu, 

= ffix=h^^ (y*. + + y^**^*^ ay. 

^ iy. + ««»-i)(»- . 8)(.-3) 

= *(y, + 8A/. + y Ay. + y Ay. + 1| Ay,). 

Now replacing the first, second, and third differences by their values as given 
on p. 48 and simplifying, we get 



854 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS (Chap. XIII 

- jt". + s/.) + ^ *Ay. . 

But hen Ay =z ft — yo« Hence 

(») ».=!f.+y +f|*Ay,. 

This is Milne’s extrapolation formula. 

To get the checking formula we integrate (1) from «o to «e + or 
from « = 0 to It = 3. Then 

Ay = h{2t^o + 2 ^/ 0 + jAVo — “AVo). 

Now replacing A/o and A^o hy their values as given on p. 48, we have 

Ay = I (y'o + 4y'i + /,) — ^ AVo . 

But in this case Ay = ft — y^. Hence 

(3) ya = yo + y (y'o + Vi 4- /a) — ^ Ay, . 

This is the second of the Milne formulas and is seen at once to be 
Simpson’s Rule. The terms involving A*/, are not used dinctly in the 
application of (2) and (3), but only as indicators of the accuracy of the 
results. 

Since x,, ■ ■ • ,Xt may be any five consecutive values of x, formulas 

(2) and (3) may be written in the more general forms 

4h 

yi»*i — y»»-a 4" (yii-a — y^«-i4“y»)> 

(3) yi»*i — y»-i 4* (y^»-i 4* 4y^» 4* 

which are the final forms of the Milne formulas. 

The principal part of the error in the value of y computed by these 
formulas is easily found as follows : 

Let and denote the values of y given by (4) and (6), 
respectively. Then if the value of h is such that the inherent error in each 
formula is given by its remainder term involving Ay, the true value of y 
at X = Xii*i is either 

+11 



Abt. 116] 


MILNE’S METHOD 


366 


or 

»=»25-^4y. 

Equating these ralnea of y, we hare 

»Sl+||*Ay=,«)_AAy, 

or 

y!,Vi-yJ.Vi = -|f AAy = 29(~AAy) =29E„ 

where denotes the principal part of the error in (6), From this we get 

(•) 

This simple formula enables the computer to test the accurate of each com> 
puted result. If we write 

(7) I> = yM—yM, 

it is well to provide a column for D just to the right of the column of y’s, 
or whatever quantity is being computed; and the behavior of the 2>*8 should 
be observed as the computation proceeds. If the D^s become erratic, look 
at once for a mistake. 

It will be observed that Milne's method requires the four starting values 
y«-» y'n-^^ y^n-i, and y'n. These values are to be found by the starting 
methods previously described in this book. Milne's method will now be 
applied to three types of differential equations. 

(a) Fquations of the First Order. To tabulate the solution of the first- 
order equation 

(a) ^ = / = /(af,y), 

we first find three consecutive values of y and y' in addition to the initial 
values. Then we find the next value of y by (4), substitute this in (a) 
for the new y and then substitute the ne« y' in (5) to get the corrected 
value of the new y. If the corrected value agrees closely with the extrap- 
olated value, proceed to the next interval. 

If the corrected value differs appreciably from the extrapolated value and 
no error can be found in the work, compute Ez by (6). If Ez is too small 
to affect the last digit to be retained, all is well ; proceed to the next interval. 
But if Ez ia large enough to affect the last figure to be retained, the value 
of & is too large and must be reduced. 



S 66 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 
Example 1. Tabulate by Milne’s method the numerical solution of 

with 

*. = 0, y« = l. 

Solution. On page 326 we found the starting values given in the first 
four lines of the accompanying table. To start the fifth line we have by (4) 

yo.4 = 1 + ^ [2(1.2103) — 1.4428 + 2(1.6997)] 

O 

= 1.6836. 

= 0.4 + 1.6836 = 1.9836. 

Now checking yo .4 by (6), we have 

= 1.2428 [1.4428 + 4(1.6997) + 1.9836] = 1.6836, 

which is the same value of y 4 .« as found by (4). Hence we consider it 
correct. 

Proceeding now to the next line, we have by (4) 

yo.i = 1.1103 [2(1.4428) — 1.6997 + 2(1.9836)] = 1.7974. 

9 

Then 

= 1.7974 + 0.6 = 2.2974. 

Now checking by (6), 

y,., = 1.3997 + 51 [1.6997 + 4(1.9836) + 2.2974] = 1.7974, 

3 

which is the same value of yo.s as previously found. 

(6) Equatione of the Second Order. Since formulas (4) and (6) are 
merely relations between a function and its derivative, similar formulas 
hold when the function is /. Hence we may write 


m 

y 

O ' 

0 

1.0000 

1.0000 

0.1 

1.1103 

1.2103 

0.2 

1.2428 

1.4428 

0.3 

1.3907 

1.6007 

0.4 

1 A 836 

1.0836 

0.6 

1.7074 

2.2074 


Then 




Abt. 118] 


MILNE'S METHOD 


887 


^nie general equation of the eecond orders ndwn idlrad for , xomj 
be written in the symbolic form 

(*>) = 

When four starting values of y and / have been found by some method^ 
the solution is continued as follows: 

1. TJse (8) to find a first approximation to the new 

2. Substitute this new in (5) to get a new y. 

3. Substitute in the given equation (b) the new y and new / to get an 
approximation to 

4. Check the new y' by (9), using the new just found. 

6. If the y' just found by (9) does not agree witii that first found by 
(8), substitute the corrected y' in (6) to get a corrected y. 

6. Then substitute in (b) the corrected y and / to find a corrected y''. 

7. Substitute this corrected y'^ in (9) to get a better and then 
substitute this last y' in (6) to get a better y. 

8. As a final check, apply (6) to the last two consecutive y^’s and y*s. 
If the error is too great, decrease h. 

Example. Compute by Milne’s method the last line of the table on 
page 341. 

Solution. Substituting in (8) the appropriate values of 9 and 9, we have 

y, = 0 + ~ (— 5.782 + 2.758 — 5.120) = — 0.5429. 

16 

Now substituting this 0$ in (5), we get 

0i — 0.2854 4- ^ 0.2879 — 1.6844 — 0.5429) = 0.2435. 

Then substituting 0t and 0s in the given equation 0 = — 0.20 10 sin 0, 

we get 

0, = 0.1086 — 2.4110 = — 2.3024. 

As a check on 0. we next use (9) with the 0s just computed. We have 

0, = — 0.2879 + i (— 2.768 • 10.240 — 2.302) = — 0.5429, 

ow 



868 SOLUTION OF ORDINARY DIFFERENTUL EQUATIONS [Chap. XIII 

whidi agrees with the value previously found. We tiierefore take these 
values to be correct 

The reader will note that formula (8) was used only once, and would 
not have been used again even if the two values of 9 had not agreed on 
the first round. 

(c) Simultaneous Equations. The solution of simultaneous equtions 
by Milne’s method can be explained best by an example. 

Example S. Compute by Milne’s method the values of i, x, and y 
in the seventh line of the table on p. 351. 

Solution. We first find Xy by means of formula (8). Thus, 

A O 

i, = 0 + ~ [2(0.5102) —0.6156 + 2(0.6162)] = 0.1026. 

3 

Now using this value of « in (5), we get 

*, = 1.0025 + ^ [0.0509 + 4(0.0767) + 0.1025] = 1.0102. 

3 

Substituting these values of x and x in the given equation y = ^(x +£ 
— sint), with t = 0.2, we have 

yt = i(1.0102 + 0.1026 — 0.1987) — 0.4670. 

Now using (6) to find y, we have 

yr = 2.0488 + ^ [0.4768 + 4(0.4665)) + 0.4670] = 2.0966. 

We next substitute in the given equation x = y — 2x-i-y + lRcos^ the 
values of x, y, and y found above, with t = 0.2. Then we get ^ = 0.5120. 
Finally, we check the whole procedure by means of (9). We thus have 

Xt = 0.0509 + ^ [0.6136 + 4(0.6162) + 0.5120] = 0.1024. 

3 

Since this value differs from the extrapolated value by only one unit in 
the last figure, we take it as correct. 

Additional lines of the table can be computed by exactly the same pro* 
cedure as employed above. 

The reader will note that the values found by Milne’s method are 
the same as those found by the difference method, formulas (108.1) and 
(108.2). 

110. The Rungi^Ktttta Method. This method was devised by Bunge * 



Amt. 116] 


THE RUNOE-KUTTA METHOD 


869 


■boat fhe year 1894 and extended by Eutta f a few years later. It is 
unlike any of the methods explained in the preceding pages. Here fhe 
increments of the function (or functions) are calculated once for all by 
means of a definite set of formulas. The calculations for the first incre- 
ment, for example, are exactly the same as for any other increment 

The formulas for several types of differential equations are given below. 

(a) Firsi-Order Equationa. Let dy/dx = /(«, y) represent any first- 
order equation, and let h denote the interval between equidistant v^ues of 
X. Then if the initial values are Xo, yo> the first increment in y is computed 
from the formulas 

fci=/(*o,yo)fc, 

^4 = /(*o + y© + 

Ay = ^(fci -H 21fi -|- 2kt -|- k©), 

taken in the order given. Then 

= *0 + k, yi = y© -f- Ay. 

The increment in y for the second interval is computed in a aimilar 
manner by means of the formulas 

k» = /(»!, yi)k, 

». = /(*, +|, 

k4 = /(*i + k, yi-\- 1ea)h, 

Ay = i(fe,-l-2fc,-l- 2k, + !:«), 

and so on for the succeeding intervals. 

It will be noticed that the only change in the formulas for the different 
intervals is in the values of x and y to be substituted. Thus, to find Ay 
in the nth interval we should have to substitute Xh.,, y*.,, in the expressions 
for ki, k,, etc. 

In the special case where dy/dx is a function of x alone the Runge- 
Kutta method reduces to Simpson*s Rule. For if dy/dx = /(x), then 

t W. Ktttto. /«v Mmth. umd Vol. 44 (ItOl). 




MO 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


*.=/(*.+!)», 

and therefore 

Ay = I [/(xo) + 2/ ^xo + 1) + 2/ ^x,; 4- 1) + /(*o + h)] 

(h) 

= 3 [/(^«) + 4/^x« + I) + /(Xo + fc)] , 

which is the same result as would be obtained by applying Simpson’s Buie 
to the interval from Xo to Xo + ^ if we take two equal subintervals of 
width h/2. 


The derivation of the formulas used in the Bunge>Kutta method is a 
somewhat lengthy process and will not be given here.* 

The inherent error in the Bunge-Eutta method is not eaqr to estimate, 
but is of the order h* f and is therefore of the same order as that in 
Simpson’s Buie. 

We shall illustrate the method by applying it to an example to which 
the previous methods were applied. 

Example. Solve the equation 


dx 


= ® + y. 


with the initial conditions Xo = 0, y« = 1. 
Solution. Taking h — 0.1, we have 


fci = 0.1 X 1 = 0.1, 

k, = 0.1 [0.06 + 1.05] — 0.11, 

= 0.1 [0.05 + 1.056] = 0.1106, 

It* = 0.1 [0.1 + 1.1106] = 0.12106. 

Ay = K0.1 + 0.28 + 0.221 + 0.12106] = 0.11084. 


* See Kutta, too. eit., or NumerUohee Reehnm, by C. Range and H. KOnig, pp. 287- 
294 and 311-313. 

t See Ktttta, loo. ott., or Numerieeke InfegrmUon, by F. A. Willers, iq>. 91-92. 



861 


Abt. 116] THE RUNOE-KUTTA METHOD 

Hence = «• + = 0.1, = yo + Ay = 1 + 0.1108 = 1.1108. 

Then for the second interral ve have 

hi = 0.1 (0.1 + 1.1103) = 0.12103, 
h, — 0.1 (0.1 + 0.06 + 1.1103 + 0.06051) = 0.18208, 

h, = 0.1 (0.1 + 0.06 + 1.1103 + 0.06604) = 0.18268, 

hi = 0.1 (0.1 + 0.1 + 1.1103 + 0.13262) = 0.14429. 

.’. Ay = 1(0.12103 + 0.26416 + 0.26626 + 0.14429) = 0.18246, 

and = 0.2, yi = 1.1103 4- 0.1325 = 1.2428. These yalaes for yi and f/t 
are correct to four decimal places. The computation can be continued in 
this manner as far as desired. 

• 

(b) Second-Order Equations of the general type 

1/'* = /(*» y, y") 

are integrated %tep by step by means of the following formulas, applied 
in the order given: 

hi = hf{Xn,yn,1^n) 

jb,=h/(*.+|, /•+!■), 

fc, = fc/(x, + |, + + y'« + “), 

hir=hf{Xn-\-h, y. + hy', + ^i:„ /» + !:.), 

Ay = hCy', + i. (i, 4- fc, + fc,)], 

Ay' = |-(i;i 4- 2J;, 4- Zh, 4- hi), 

where n = 0, 1, 2, * * 

For the special second-order equation 

tbs mcremeiiti in , nnd y' u« found fonm tile formnlee: 




862 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


*1 = */(*., 

Ay = h\y • + ^ + 8^i)]> 

Ay' = J (fci + 4J;, + lr,). 

c) Simultaneous Equations of the First Order. In a pair of simple 
simultaneous equations of the type 

f = /i(<,*,y) 

^ x,y) 

the increments in x and y for the first interval are found from the 
following formulas: 

ki = /i(fo) *o> yo)A<, 

fc,-/. (f«+y, + yo + |-)A<, 

fc.— /i^fo + y, + 

^^4 = /» (^0 + At, Xo + kt, Jfo + 1») At, 

Ax = “I" 2fej -|- 1 : 4 ). 

( 4 ) 

11 = / 2 (<o, ®o, yo)Af, 

1 2 = A ^<0 + ^ ^ 

*» = /* ^<0 + ^ » *® ^ 

^4 = /*(to + At, 3(0 -^-kt, yo>-i~ls)At, 

Ay = ^(Ii + 2Zj + 2Zi -f- Zi)* 

The increments for the succeeding intervals are computed in exactly the 
same way except that to> xo, yo are replaced by ti, Xi, yi, etc. as we proceed. 

The simultaneous equations 




An. 116] THE RUNGE-KUTTA METHOD 363 

are solved by formulas (4) by changing < to x, x to y, y to s, and 
patting At = h. 


(d) Simultaneous Equations of the Second Order.* To solve the pair 
of simultaneous equations 


try . , dy dz . d*z , . uy u« . 


dy 

dx* dx‘ 


by the Bunge-Kutta method, put 


dx dx~‘^ 


Then the system (5) becomes 


~ =f^(x,y,z,u,v), ^ = /2(®,y,*,«,«). 


Let h denote an arbitrary increment in x, and let k, I, m, p denote 
the corresponding increments in y, z, u, and v, respectively. Then k, I, 
m, p are found according to the following computation scheme: 


y 

s 

ki = uh 

II 

i,= («4-^)h 

U={v-\-^)h 


+ 

II 

b* = (« + ma)h 

*4= (V + P3)fc 

*- = J(ifej + 2*, + 2A:, + *«) 

Z = J (/l 4“ 2/2 + 2Z3 + ^4 ) 


24 



364 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS (Chaf. XIH 


u 

i; 

mi=fi{x,y,z,u,v)h 

Pi=f 2 {x,y,z,u,v)h 

It 1 ^ 1 1 fi 

f>2 = /2(® + ^,y + Y,2+ 



Ws = /i(® + g>y + + ^> 

Pa=/2(® + |,y+Y,2 + ^, 

u + ^,v + f)h 


m« = /i(ii -hh,!/ + + h, 

P4 = /* (a: -1- fe, y 4- 2 + 13 , 

u-f- mj,v -j- ps)h 

u -|- ms, W 4“ Pa)f^ 

m = J (m, 4- 2m.. 2m., -f- m 4 ) 

P = iiPi + ^P 2 + 2 pa + P*) 


To solve the system 
( 8 ) 
put 

X 


^j;=fi{x,y,z,x,y,z,t) 

^ i/=f2{x,y,z,x,y,z,t) 
j = f3(x,y,z,x,y,z,t), 

= y = i = V,. 


Then (8) becomes 

^ v, = fi{x,y,z,v^,Vy,V3,t) 
( 9 ) >Vy = f2(x,y,z,Vy,Vy,v.,t) 

J Vy = f3(x,y,z,Vy,Vp,Vt,t). 


If h denotes an arbitrary increment in t, and k, I, n, p, q, s denote the 
corresponding increments in x, y, z, Vx, r*, respectively, these incre- 
ments are found according to the following scheme: 


hi = Vsh 
k3=iVx-\-^)h 

ky = (t'x 
i!4= (Vx + P3)h 


ll = Vyh 

h — ^ ^ ^ 

h= {Vy + qa) ft 


n, = Vth 

n*= (v,+ 2 )* 

n3= (v*+ )h 

»4= (i’* + «s)A 





THE RUNGE-KUTTA METHOD 


366 


Axt. 116] 

l>a = /i(* + y,y + ^,a + -y,V»+ + 

P8 = /i(® + -^,y + ^,2 + ^,t>jr + -Y»®»+ 

Pt = /i(« + *3, y + h,i + «8, »» + Pa, Vg + }8, + «8, < + h)h 

qi = f2{x,y,z,v„vg,v„t)h 

q2 = f2{x-\-^,y+ 

?i = /2(®+ -^>y + + 

?4 = / 2 (a: + fes, y + h,z + ns, t;, + ps, Vy + qs, v, -{• St,t -\- h)h 


Si ^Xy ^yy Vtjt)h 


s2 = /s(x- 4- y»y + + "2 


92=f2(^-\- ^>y-^ Y>^-h-^,Vx-\- ^,Vy-{-^,V,-\-^,t-\-—)h 
Si = /s(x + fea, y + ^S, Z + ris, v, + p„ Va + g*, v^ -{-Sa,t + h)h 


h 


k=^{ki-\- 2*2 + 2k, + ki) p = J(pi + Zp, + Zpa +p4) 

i ^ (Zl + 21, -|- 2li 4" ^l) ? — i (?! ~l" 2^2 4” 2^3 4" 74^ 

n = i (n, 4- 2 n 2 + 2 n 3 4- 7i,) s z= J (s, 4- 2sg 4- 2Ss 4- ««) 

The system 

f r = /x(r,»>,tf,r,f,tf,0 

(10) -< ‘,p = f^{r,ip,9,f,f,6,t) 

^ — /s (r, ^2, r, 1 ), 

is solved by putting 

f = a, 9 = P> 6=.y. 

Then (10) becomes 

fi = fi(r,v,9,tt,,8,y,t) 

(11) < fi = f2{r,vAa,fi,y,t) 

\^y = f3{r,if,9,a,fi,y,t). 



366 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


Let h denote the increment in t. Then if h, I, tn and p, q, s denote the 
corresponding increments in a, fi, y and r, 9, respectively, these incre- 
ments are computed according to the following scheme: 

1ei—fi{r,9A9,P,y,t)h Pi = ah, 

P2= (« +-^)^ 

P,= (« + |U 

hi = fi{r + pi,^ + qa, 9 -f-Ss, a + h, fi la,y + niaj + h)h 

pt— («-ffe,)h 

lx = f2{r,<f,9,<x,fi,y,t) qi = ph 

q2={P + \)h 

la = fa{r+ + + + + 

9.= (/8 + |)* 

{4 = ^2 (r -f- Pa, <p -|- qa, -|- Sa» * ~H ^3> P ^3) y t h)h 

94 = (^ + h)h 


mi = fa{r,ip,9,a,P,y,t)h 


Si = yh 


»»2 = /3(»’-|- "2 »*+ ^ 2'’^"^ 2 


«.= (y + Y)* 

»n3 = /s(»’4- 2’’^"*” 2^^^ 


92 



Art. 117 ] 


CHECKS, ERRORS, AND ACCURACY 


807 


TO4 = f»ir + Pi, fP + 3», + «„ « + k„fi + h,y + m„ t +'A)A 

«4= (y4-m,)A 

Je = + 2^2 -j- 2At + ^4) P — 1 (pi "I" 2p2 -|- 2pi + ^4) 

^ — i(^i “H ~f" ^ 4 ) Q — J(?l + 2^2 + 2^3 -f" ?4) 

m = J (»»i + 2m2 4- Sms + *>^ 4 ) s = i(«i + 2*2 + 2*3 + «,) 

The accuracy of all results obtained by the Runge-Kutta method can 
be estimated by performing a new computation with intervals of width 2A. 
Then by (8), p. 187, the error m the first computed result will be 1/16 
the difference of the two computed results. 

117. CheckSi Errors, and Accuracy. Attention has already been called 
to the use of formulas (108. 2), (108. 3), (108. 4), and (108. 6) for checking 
the computed change in a function over a single interval. A formula has 
also been given for checking the results found by Milne's method. Simpson’s 
Rule furnishes a convenient and reliable means of checking the summation 
of any function over an even number of intervals. For ezan^ple, the 
decrease in the horizontal velocity of the bullet in the example of Art. 121, 
from ^ = 2 to f = 26, is 

A® - xdt = y [2* 4 4(24 4- 2, + 4- «i6 4- 22, 4- 224) 

4. 2(2, 4* 2,0 4- 2,4 4- 2„ 4- 2„) 4- 2„], 

or 

A® = S[— 19.52 4 - 4(— 16.26 — 11.88 — 9.43 — 8.27 — 7.91 — 7.88) 

4- 2 (— 13.77 — 10.46 — 8.72 — 8.02 — 7.88) — 7.90] = — 247.76. 

Hence 

x,, = 567.32 — 247.76 = 319.56, 

which differs from the value in the table by only two units in the last digit. 
The fifth figure in all these numbers is uncertain, probably worthless, but 
the two methods certainly check within a unit in the fourth figure. The 
values of y and y may be checked in a similar manner. 

A single error in any one of the quantities ®, y, and y will ^persist 
throughout the computation in the column in which it occurs, but its 
effect will usually not increase as the computation continues. An error 
in the acceleration will likewise persist and will affect in some degree all 
the other computed quantities, but the effect may not be serious. An 
error in the differences of the acceleration and in the second, third, and 
fourth differences of the other functions will soon disappear, and its effect 
on the final results will usually be negligible. If several errors are made. 



368 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIU 

they will probably neutralize one another to a considerable extent, but it 
is possible that they may accumulate sufficiently to affect seriously some 
of the later results. 

As an example of the effect of a single error near the beginning of a 
computation, it may be stated that the example of Art. Ill was first com- 
puted throughout by starting with an error of two units in the last digit 
of 0 for t = 0.05 sec. The maximum error in any subsequent value of 

0 up to t = 2.1 was five units in the last digit, whereas the greatest error 

• • 

in any later value of $ and $ was only two units in the last figure. 

An error of more than a unit in the last digit of a computed result 
can usually be detected by inspection of the second, third, and fourth 
differences of that result. If these higher differences run smoothly — 
that is, vary in a regular fasliion without sudden changes in magnitude 
or sign — , it is quite certain that no error has been made; but if the 
third and fourth differences become grossly irregular, the student had 
better stop and look for an error at once. The error may be located 
approximately by the method explained in Art. 17. The computer should 
watch the behavior of the higher differences as he goes along, so as to 
detect an error as soon aS possible after it appears. 

The safest plan to insure accuracy is to take h so small that fourth 
differences will be negligible to the number of figures desired in the final 
results. When fourth differences are negligible, the application of formula 
(108. 2) as many times as it will effect improvement will usually insure 
that the error is less than half a unit iu the last figure retained. Since 
these half-unit (or less) errors are as likely to be positive as negative, 
they are largely neutralized in the calculation process. Hence it is not 
worth while to consider them in estimating the accuracy of a final result. 

Whatever method is used in tabulating the numerical solution of a 
differential equation or a system of equations, the successive differences of 
all computed quantities should be computed and recorded. The behavior 
of the differences will show at a glance whether a mistake has been made 
in the computation or whether the value of h is too large. 

118. Some General Remarks. The methods given in the preceding pages 
are believed to be sufficient for the numerical solution of all ordinary 
differential equations having numerical coefficients and sufficient initial 
conditions. Equations higher than the second order have not been treated, 
but equations of the third and fourth orders can be handled by the methods 
given. All that has to be remembered is that formulas for integrating 
ahead and starting a new line are to be applied to the derivative of 



GENERAL REMARKS 


Art. 118] 


869 


highett order in the equation {or equations) and the various differences 
of that highest-order derivative. 

The most important matter in the numerical solution of a differential 
equation is getting correct starting values. These can be found by 
several methods, but in some of them there is no certain means of deter- 
mining the accuracy of the results found. When the starting values are 
found by the modified Euler method, the value of h should be so small 
that one or two repetitions of the averaging process will give the final 
result for that value of h. Likewise, when the starting values are found 
by Taylor’s series, h should be so small that only three or four terms of 
the series have any effect on the computed result. 

Beliable starting values can be found by the Runge-Kutta method in 
many cases, but usually at a greater expenditure of labor than by other 
methods. 

The Milne method o^ finding starting values is accurate and reasonably 
short. When the derivative of the given highest derivative can be found 
without difficulty, this is a good method for computing the starting values. 

After correct starting values have been found, there arc two good methods 
for continuing the computation : the method employing differences, formulas 
(108. 1) and (108. 2) for example, and the method of Milne. Which of these 
is preferable probably depends on the taste and equipment of the computer. 
In the difference method much of the work can be done mentally and with 
very little effort. On the other hand, if the computer is equipped with a 
computing machine and is expert in using it, he may find the Milne 
method shorter and easier. 


The Runge-Kutta method is too laborious for tabulating many steps of 
a numerical solution unles.s a computing maLhine is used. 

The Picard method gives the solution theoretically when the derivative 
is any type of function, but the method is of limited practical value 
because of the difficulty frequently encountered in performing the required 
successive integrations. 

The reader has observed that the numerical solution of a differential 
equation by any method involves considerable labor. But the numerical 
methods also have certain redeeming features in their favor; for they pro- 
vide a means of obtaining solutions to problems which could not be solved 
otherwise, and they also give a complete record of the behavior of the 
functions within the regions considered. In some problems the exact 
analytical solution may involve more labor than the numerical method if 
certain information is desired. The following example will illustrate this 



370 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 
point. 

Suppose the differential equation 

^ . y — ^ 

dx y + * 

is given, with initial conditions Xo = 0, yo = and it is required to find 
several corresponding values of x and y. The given equation can be solved 
by putting y = vx, separating the variables, and integrating. The result, 
for the given initial conditions, is 

To find pairs of corresponding values of x and y from this equation we 
could substitute the desired values of x and then solve the resulting equation 
for y. But this resulting equation will always be a complicated transcen- 
dental equation which can be solved only by trial — by Newton’s method 
or otherwise. The labor of solving this equation for even a single value 
of y would probably be as great as that of computing several tabular values 
by numerical integration. The numerical method might therefore be the 
easier in this example. 

The numerical solution of a differential equation, however, will give no 
information concerning the function outside the range of computed values, 
whereas the exact analytical solution will enable us to predict the behavior 
of the function for any values whatever of the independent variable. For 
this reason the solutions of differential equations expressing natural phe- 
nomena should always be obtained in analytical form if possible. 

EXERCISES ZIV 

1. Solve the simultaneous equations 

with the initial conditions z = 0, y = 0.5 when t = 0. Compute the first 
six lines of a tabular solution. 

2. Compute the first six lines of a tabular solution of 

d9 

with the initial conditions $ — 30®, -r- = 0, when < = 0. 

at 

i In (i> + y>) + l.n-> ^ • 

3. Use the method of Art. 112 to solve the equation 



Abt. 119] DIFFERENTIAL EQUATIONS OF EXTERIOR BALLISTICS 


371 


with the initial conditions 0 = 5“, ^ = 0, when t = 0. Tabulate the first 

six lines of the solution. 

4. Use the method of Art. Ill to solve 

0.0002959 . „ fdr\* 

5p= ?— + 

with the initial conditions r = 1, ^ = 0, when t = 0. Compute the first 
six lines of a tabular solution. 

5. Compute the first ten lines of a tabular solution of the simultaneous 
equations 

d*x _ 0.0002959X 

d^y _ 0.0002959y 

dt* ~ r» » 

with the initial conditions x = 0.31, y = 0, ~ = 0, ^ 0.034, when 

at at 

< = 0. Here r = V®* -1- y*. 

IV. THE DIFFERENTIAL EQUATIONS OF EXTERIOR BALLISTICS. 

119. The Simplest Case— Flat Earth with Constant Acceleration of 
Gravity. This book is not primarily concerned with the derivations of 
differential equations, but inasmuch as one of the main fields of application 
of numerical integration to the solution of differential equations is that 
of exterior ballistics — the science which deals with the motion of a pro- 
jectile after it leaves the gun — , it seems not amiss to sketch briefiy the 
derivation of the fundamental differential equations of the motion of pro- 
jectiles. The projectile will be considered as a material particle acted on 
by the force of gravity and by a tangential retarding force due to the 
resistance of the air. In the present article the acceleration of gravity will 
be assumed constant in magnitude and direction, which means that we are 
assuming a fiat earth and that the projectile does not reach a great height. 
The air resistance is proportional to some (variable) power of the velocity, 
which power itself depends on the velocity. The equations will be derived 
first by taking 0 as the independent variable and then by taking time (t) 
as the independent variable. 

Case L Taking 0 as the Independent Variable. Let a projectile of 
weight W be fired with an initial velocity Vo at an angle of elevation 
let V denote the velocity of the projectile at any point in its path and let 
0 denote the inclination of the velocity vector at that point; and, finally, 
let p denote the radius of curvature of the trajectory (path) at the point 



372 SOLUTION OP ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

in question and let kv" denote the air resistance at that point. Then 
resolving forces along the tangent and the normal at P (see i'ig. 17), 
ve have by the fundamental law of dynamics: 


( 1 ) 

( 2 ) 


— kv*—W sin 6 = 


W dv 

g df 


— W cos d = 


W ^ 

g p ' 



Then from (1), 
and since 



(■^t;»4-8intf y, 


we have 


or 

( 4 ) 


do _ ^ ^ ^ ^ 

dt~ ds dt~^ ds* 

^ _ odo 

y(^»" + 8intf ). 


gcw$ 


or 



Art. 119] DIFFERENTIAL EQUATIONS OF EXTERIOR BALLISTICS 878 

Equating the values of d» from (3) and (4) and simplifying slightly, 
we obtain 

dv 

(6) ^ — V tan = c sec d, 

where c = h/W. 

This equation (6) is frequently called the fundamental equation of 
exterior ballistics. When n is an integer, the equation becomes the well> 
known Bernoulli type of linear differential equation and can be solved for 
V as a function of 0. 

The values of the exponent n for various velocities are as given below : 


0 < 0 < 790 ft./sec.. 

n = 

2 

790 < 0 < 970 ft/sec.. 

n = 

3 

970 < i> < 1230 ft./eec.. 

n = 

6 

1230 < » < 1370 ft./8ec., 

n = 

3 

1370 < V < 1800 ft/sec.. 

n — 

2 

1800 < » < 2600 ft./sec., 

n = 

1.7 

2600 < t> < 3600 fi/sec., 

n = 

1.55. 


For n = 2 and the initial conditions 9 — v = Vo, the solution of 
(5) is 

(119.1) ^ = ccos«g [sec^tan^— secgtan^ + ln 

COS*g 

Vq* cos* ^ ’ 


To find in terms of 9 the rectangular coordinates of any point on the 
trajectory and the time of flight of the projectile, we have 


dx 

d9 

d9 

d9 


dx ds A V* 

— =zpcostf = . 

dsd9 g 


_ ^ ds 


^.'^^=pBxn9-pcos9 • ^ = 
dsd9^ cosd 

^.*=i(p)=^=-^secd. 

ds d9 V g 


«* 

— tans. 
9 


Integrating these three expressions from d = ^ to d = d, we get 



374 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


(119.8) 


f * = — 1 fV<W 

- y = — - r V* tan $ do 

gJ* 

1 /*• 

< = T — I V8ec3(i0. 
^ gJf ■ 


These integrals can be computed by Simpson’s Buie as soon as the values 
of the integrands are known for equidistant values of 0. In the case n = 8 
tne values of v can be found from (119.1). 


Caae II. Taking Time (t) as the Independent Variable. In most 
ballistic calculations it is better to take time as the independent variable. 
To find the differential equations of motion for this case, we resolve forces 
in the horizontal and vertical directions. Then (see Fig. 17) 


or 


IF 

— kv* CCS 0 = — i 

9 

W 

— it>" sin 0 — IF = — y, 

9 


2 = 


w 


V" cos 0 — — R COS 3, — 


R . R. 

— V COS 3 = X 

V V 


y = 


% 

W 


t;"sin 3 — g = — i?sin 3 — g = —’vamO — g 



where R^z—v”. Hence the fundamental differential equations in rect- 
angular form are 


(119.3) 



9- 


These equations are connected by the velocity v = + y* and must 

therefore be integrated simultaneously. 

120. The General Case, Allowing for Variation in Air Density with 
Altitude. The ballistic equations thus far given do not permit the decrease 



Abt. 121] 


METHODS OF FINDING STARTING VALUES 


3T6 


in air resistance and gravity with altitude to be taken into account and are 
therefore inadequate for modem gunnery. To allow for variation in air 
density with altitude it has been the practice in this country to write the 
fundamental difFerential equations in the forms : * 


( 120 . 1 ) 

where 


St — — Ex 
y = — E^ — g, 

— c — 


Here G{v) is a function of the velocity alone, H (y) is a function of the 
altitude alone, and C is a constant whose value depends on the weight and 
shape of the projectile. The function H{y) is 


Hiy) 


JQ-0.000045y ^ ^-O.OOOlOSfty 


when the altitude y is in meters. The function G(v) is much more com- 
plicated. These functions G(v) and H{y) have been tabulated for a wide 
range of values of v and y.X 


121. Methods of Finding the Starting Values. The use of Taylor’s 
series in starting the computation of a trajectory by numerical integration 
is out of the question, because of the difficulty in finding successive deriva- 
ties of the given differential equations. The Runge-Kutta method is 
likewise unsuitable for obtaining the necessary starting values. The Milne 
method can be used for the simple problems not requiring the j^-function, 
but in the general case of projectiles fired with high velocities and reaching 
considerable heights, the starting values must be found by one of the 
following methods: 

(a). The Modified Euler Method. This method, as previously stated, 
will give starting values in any problem, provided short steps (intervals) 
are taken. In starting the computation of a trajectory, two consecutive 
values of the required functions, in addition to the initial values, are com- 
puted by taking very short intervals of time. From these three values 
first and second differences are formed. Then the computation is con- 
tinued by means of formulas (108. 1) and (108.2). After the first five 
values are found, they are checked by formulas (108. 3), (108. 4), and (108.5). 



978 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chaf. XIII 


(b). Barker^s Hyptrholxe-Are Method. In recent years a nev and 
entirely different method of starting the computation of trajectories has 
been devised by J. E. Barker.* Having in mind the fact that the path 
of a projectile £red through the air has a vertical asymptote at no great 
distance from the gun (usually at a distance of two to three times the 
horizontal range), Barker conceived the ides of replacing the trajectory 
near the origin by the arc of a hyperbola passing through the origin and 
having a vertical asymptote at a horizontol distance c from the origin. 
The equation of such a hyperbola is of the form 


( 1 ) 


x(ax + b) 
c — x 


where a, b, and c are undetermined parameters. These parameters are 
to be determined by the condition that the hyperbola shall have third-order 

contact with the trajectory at the origin. This means that and 

for the hyperbola must equal the same three derivatives of the tra- 


jectory at the origin. The hyperbola will then approximate the trajectory 
extremely closely in the neighborhood of the origin and may therefore be 
used for computing starting values for the trajectory in that region. 
From (1) we get by differentiation 


dy 2acx -}- 6c — ax* 

dx~ (c — x)* 

d^y -j- 26c 

dx* ~ (c — x)* ’ 

d*y 6c(oc -1-6) 

Ldx»“ (c — x)* ' 


To find the corresponding derivatives of the trajectory, we have 


^ = 
dx 


tan 0 


9 


d^y .,dB ..d/9/dt 

-t 4 = sec* 9-J- — sec - 9 = sec- 0 -7 

ax* dx dx/di X 


— 9 see 9- = d sec ^ . 

X X cos 0 


Mathematician at the U. S. Naval Proving Grounds, Dahlgren, Va. 



Art. 121] 


METHODS OF FINDING STARTING VALUES 


177 


n i. • ^ ds/dt V . V* 

But since = and p by Art. 119, we have 

* = — T > from which d sec 0 = — ^ . Hence 

1 H COB » V 

£y—_9^ _J 9 L — 

dx* V X cos 0 XV cos 0 xv, x* * 

Also, 


da^ 





i _ 2g{—Ex) _ 2gE 

^ X* ~ X* ~~ X* 


Now equating these three derivatives to the corresponding derivatives 
in (2), putting a; = 0, x = Xo, 9 = ^, E = Eo, and then solving the 
resulting equations for a, b, and c, we get 


and 



b 


3 io 
2 Eo 


tan^, 



— tan^. 


On substituting these values in (1) we get g in terms of x. 

But since the independent variable in the ballistic equations is t, we 
must have x and y in terms of t. To get the required relations we go 
back to the relations 

d*y 2c(ttc -V ft) _ g 
dx* (c — x)* “ i* 

Solving for x, separating the variables, and integrating, wo have 


V—2c(ac + b) J'(c — x)-»'^dxz=VyfJdt. 


From this we find 


x = c 


/ 2V-2(ac+ft) y-[ 

L 4- 2V— 2(0C + 6)/ -I 


Beducing the right-hand member to its simplest form and then replacing 
a, b, and c by their values as previously found, we get 



878 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS (Chap. XIII 
When this Tslue of x is substituted in (1), the value of y becomes 

which is equivalent to 


6Eoyo 


27 (Zg + 4Jg,y,) 

BE^oW + zy • 


^ ~ 8E\ 4^*0 

Differentiating (8) with respect to t and simplifying, we get 


( 121 . 1 ) 



which may be written as the binomial series 

(121.2) x = ioll-(Eot) +1 Wy-^ (W + ^ W )*- • •] 


On differentiating (4) with respect to t, we find 


y = 



1 + 


¥)+*•('+¥)■ 


In view of (121. 1) and the fact that yo/^o =*tan the third term on the 
right is equal to itan^. Then on replacing the middle term by ita 
binomial expansion and reducing the right-hand member to its simplest 
form, we get 
I 

(121.8) y = itan^— y<[l — (l/2)(£«0-f ( 6 / 18 )(jffoO*— ( 6 / 36 )(^oO‘ 

+ •••]• 

From (121. 1) or (121.2), 2 is readily found. Then this value of x is 
used in (121.3) to find y. Formula (4) is not convenient for finding y. 
When the values of y.i, yi, and y^ have been computed by means of (121. 3), 
the values of y are more easily found from the quadrature formulas 

i».i = «o — ^ (9ii., + 19tio — Siii + fi,) 

■{ = + ti_, +13(tio-l-tii)— «*] 

^ = *0 + (Uo 4* 4tti -H Va)> 


(121.4) 



Art. 121] METHODS OF FINDING STARTING VALUES 879 

where u stands for any variable, « its time derivative, and A.t(=h) is tiie 
interval between equidistant values of t. Note that t in (121. 1), (121. 2), 
and (121. 3) may be either positive or negative, hut in (121. 4) is always 
positive. 

The values of x and y found from (121. 1), (121.2), and (121.3) are 
extremely accurate, but they, can usually be slightly improved by iteration 
by means of formulas (121.4). The computation of a trajectory is started 
by the Barker method as follows: 

Close approximations to ±.i, x,, x, and y.„ y,, yt are first found by means 
of (121.1) and (121.3), using t=z — Ht, t = A<, t = 2A<. These x's and 
^8 are then substituted in the given differential equations (120. 1) to get S.i, 
2t, Sit, and y,, fjt. Then these X’s and y*8 are substituted in the right- 
hand members of (121. 4) to get improved values of x.i, Xi, it, And y.,, y,, 
^ 1 . If there is any reason for thinking that these last values can be 
improved, the iteration process through (120.1) and (121.4) is repeated. 

When the final values of y.,, y,, and yt have been found, they are sub- 
stituted in the right-hand members of (121.4) to get the values of y_i, 
yi, Vi- If the values of x.,, x„ Xt are desired, they can be found in a 
similar manner. 

When two or three sets of values of the several quantities mentioned 
above have been found by the process outlined, differences up to the second 
or third order will.be available. The computation is then continued by 
means of the difference formulas (108.1) and (108.2). The application 
of the Barker method to a simple trajectory will be shown below. 

The actual computation of a modem high-altitude trajectory cannot be 
given here, because of lack of space for the necessary tables. The reader 
will find such a problem worked out completely in the Encyclopedia 
Britannica, Twelfth Edition Vol. XXX, p. 390. A simple trajec- 

tory will be worked out below. 

Note. Since the advent of high-altitude rockets, the motion of projectiles 
in a vacuum has become of practical importance. A simple and direct 
treatment of this astro-ballistic problem will be found in the following 
paper: "The Actual Path of a Projectile in a Vacuum,” American Journal 
of Physics, Vol. 13, No. 4 (August,), pp. 253-265. 

A thorough and masterly treatment of the motion of projectiles ai^ 
rockets under all conditions will be found in Ballistics of the Future, ^ 
Kooy and Uytenbogaart, Vol. XXX, p. 390. A simple trajec- 

Example. A bullet is fired at an angle of 38* 30^ with the horizon and 
with an initial velocity of 780 feet per second. Assuming that the air 


25 



380 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 


resistance varies as the square of the velocity of the bullet and that the 
resistance coefficient is — 0.00005, find the range, time of fiight, and angle 
of fall of the bullet. 

Solution. Let 6 denote the angle which the velocity vector makes with 
the horizontal at any instant. Then the equations of motion are 

d*x 

-3-r = — cos 5 = — 0.00005t>* cos $, 

= — /fsintf — g = — O.OOOOSv* sin $ — g, 

where i2(= 0.00005v*) denotes the tangential retardation. Since vcos5 
= Va = dx/di and v sin 5 = Vy = dy/dt, the equations of motion can be 
written in the form 


d*x 


dx 


-^ = -0.000051;-^, 

S=- 0 -« 0006 .§-«. 


These can be reduced to a system of first-order equations by putting 


dx . dy ^ di 

— ~77 > y ® At 


dt’ dt’ dt 
Taking g = 32.16 ft./sec.*, we then have the system 
dx 


d^x f. _dy _ d-y 
dt^* 


dt~ di*’ 


= X. 


dt 

dx 

^ =2 = — O.OOOOSvi, 
dt 


dt 


= y 


^ z= y z= — O.OOOOSvy 


32.16. 


To start the numerical solution of this system of equations we first 
find the initial values of the velocities and accelerations. Thus, 


Vo = 780, 

fo = Vo cos 38® 30' = 610.44, 
yo = Vo sin 38® 30' = 485.56, 

Xo = — O.OOOOSVoio = — 23.81, 

yo = — 0.00005ooyo — 32.16 = — 51.10. 



Art. 121 J ILLUSTRATIVE EXAMPLE 881 

These quantities give the first line in the table to be computed (page 388). 

To find additional starting values in this example by the Taylor-seriea 
method is out of the question, because of the difficulty in finding the higher 
derivatives of the given equations. But since the derivative of 

X = — O.OOOOSi Vit* + y* is 

r- + + 

V + / 

and a similar expression for y, the starting values could be found by the 
Milne method. However, for the purpose of illustrating the methods, we 
shall find the second and third lines of the table first by the modified 
Euler method and then by the Barker method. 

(a). Modified Euler Method. To find the second line of the table on 
|»age 382 by the modified Knler method, we assume that the initial accelera- 
tions will remain practically constant for the next 1/4 second. Hence 


Ay — J(— 51.10) = — 12.78, 
Ax = i(— 23.81) — — 5.95 ; 


and therefore 

yi/4“’ = yo + Ay = 485.56 — 12.78 = 472.78, 

- io + Ai = 610.44 — 5.95 = 604.49. 

Since v — V + y*> we have 

= V (604.49)* -1- (472.78)* = 767.42. 

Then 

y,/4‘*» = — 0.00005 X 767.42 X 472.78 — 32.16 = — 50.30. 

y,/4<*> = — 0.00005 X 767.42 X 604.49 23.19. 

Better values for Ay and Ai are therefore 


1 /— 51.10 — 50.30\ 

^y=:A — 2 — ) 

^*= 4 ( — 2 — ; 


= — 12 . 68 , 
= — 5.88. 


.-. y,/4<*> = 485.56 — 12.68 = 472.88, 

4,/«(*> = 610.44 — 5.88 = 604.56, 

= V (604.56)* -f (472.88)* = 767.53. 



388 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

The third approximations for the accelerations at the end of the interval 
are then 

— 50.31, 

= — 23.20. 

These values differ from the preceding values by only one unit in the 
last digit and will give the same values of x and y as before. Hence we 
take them as correct for the present. 

To find the value of y when f = ^, we have 

1(4806 1 47!!.88) 
y^^^ — 0 + 119.80 = 119.80 

The values are now known for the second line in the table. 

To find the third linn we assume that the acceleration at the end of 
the first } second will remain practically constant for the next \ second. 
Hence wc have 

= i(— 23.20) = — 5.80 
= l‘(— 50.31) = — 12.58. 

Then 

(f ),/,<» = 604.56 — 5.80 = 598.76 
= 472.88 — 12.58 = 460.30 
(»),/,“> = V (598.76)*+ (460.30)* .-= 755.24, 

(x) ,/*<*> = — 0.00005(598.76 X 755.24) = — 22.61 

(y) i/2<*’ = — 0.00005(460.30 X 755.24) — 32.16 = — 49.54. 

'Better values for Ax and Ay are now 

(4i) ./.'«= i 6 .73 

Hence 

(x) ,/,<*> = 604.56 — 5.73 = 598.83 
(y ),/,<*> = 472.88 — 12.48 = 460.40 
(»),/,<*) = V (598.83)* + (460.40)* = 755.36 

Then 

(S),y,<») = — 0.00005(698.83 X 755.36) = — 22.62 

(y) i/a<*’ = — 0.00005(460.40 X 755.36) = — 49..56. 



Art. 121] 


ILLUSTRATIVE EXAMPLE 


383 


Theae values differ from the preceding values by only one unit in the last 
figure, and they give the same new values for Ax and A^. Hence we con- 
sider them correct for the present. The increment in y for the second 
interval is 


Ay = 


1 / 472.88 -f- 460.40 
4\ 2 


116.66. 


Hence the new value of y is 

yut = 119.80 116.66 = 236.46. 

(b). Barker Method. To find the second and third lines of the table 
by the Barker method we first note that 


and 


ffo = 0.00006 X 780 = 0.039 
tan ^ = 0.79644 


Then on substituting f ^ in (121. 1) or (121. 2) we get x., = 616.43. 
Now substituting in (121.3) thfs value of x., and < = — we get 
^.1 = 498.41. The value of v., is then 


v.x = V (616.43)* + (498.41)* — 792.72. 


These values are now substituted in the given equations to get the corre- 
sponding accelerations. Thus, 


X.i = 

y-i = ' 


792.72 X 616.43 

20,000 

792.72 X 498.41 

20,000 


— 24.43, 

32.16= — 51.91. 


These quantities are placed in the first line of the trial table (Table A, 
p. 385). The quantities in the third and fourth lines of Table A are found 
in exactly the same manner by substituting t | and f = }, respectively, 
in formulas (121.1) and (121.3) or in (121.2) and (121.3). 

We next check these computed trial values by substituting the accdera- 
tions in the right-hand members of (121.4). Thus, taking A< = i, 


(x.i)<») = 610.44 — ^ [9(— 24.43) -f 19(— 23.81) — 6(— 23.20) — 22.61] 
96 

= 616.47, 

(^.,)<»> = 486.66 — ^ [9(— 61.91) + 19(— 61.10) — 6(— 60.30)— 49.64] 
96 

= 498.44, 



384 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 

and similarly for the other Talnes of i and The corresponding v^s are 
then computed and then new accelerations are found from the giTon 
differentisl equations. 

It will be otmeryed that the improved values in the second table (Table B) 
differ very little from the first computed values. A second iteration by 
formulas (131. 4) makes no improvement whatever except in the case of ffi/t, 
which it changes from 460.41 to 460.40. We therefore take the vdues 
in Table B to be correct. 

We are now ready to find y for < = i and < = These are found 
from (131.4) as follows: 

yi = 0 + ^ [— 498-44 + 13(486.56 + 473.88) — 460.40] = 119.80 ft 

y, . 0 + ^ (486.56 + 1891.52 + 460.40) = 236.46 ft. 

It will be noted that nearly all these values in Table B are identical 
with those found by the modified Euler method. 

Now having three complete lines of the table, we can form first and 
second differences of the quantities x, y, x, y. Then we can continue the 
computation by integrating ahead and back-checking. For this purpose 
the following formulas are applied in the order in which they are written : 

(1) Ay = $dt = A< + 1 Axy» + ^A,y» + 1 A,y, + 1 A^y, J , 

for finding y in a new line ; 

(2) Ax = -idt = A< [xn I AxX» —A^Xn -f I A,2» + | A^x,] , 

for finding x in the new line; 


( 3 ) 

v=Vx*-fy* ; 


( 4 ) 

y = _0.00006vy — 

33.16; 

( 5 ) 

x = — O.OOOOSvx; 


(6) 

Ay = J* ydt = At | 



for checking and correcting the value of j) found by (1) ; 


(7) Ax = Mt = l^x, — I AjX, — ^ Ajx, — ^ A,2b J , 
for checking and correcting the value of x found by (2) ; 






386 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS IChap. XIII 


= SI ^ ~ ■ 

for finding the new y after the correct value of ^ has been obtained. In 
these formulas the instant ta»i in (1) and (2) is the same as in (6), 
(7), and (8). 

The increments in x tot the several intervals can be found by means 
of the formula 

(9) Ax = = At I-AA — ^ Aa*ii — ^AA — 

after the correct value of x has been found for the interval considered. 
Since only the range is called for in this problem, however, it is not 
necessary to find x at the end of each interval. The range is more easily 
found by means of Simpson’s Rule, as follows: 

* = xdtz=Y [®0 + 4(fx + i, + • • • + Xn-l) 

+ 2(f| + *4 + * ■ + *»-t) + ill]» 

where T denotes the time of fiighi 

The table is continued with the time interval At = ^ sec. until five 
lines have been computed. 

The computed values of Ay and Ai are then checked by applying 
fomulas (108. 5), (108. 4), and (108.3) to the acceleration values in the 
fifth line, checking the interval t = 0 to t = 1/4 by (108.4), the second 
interval by (108. 5), etc. The checks show that all computed values are 
correct. 

Since the correct values are given at the first trial for t = t = f , and 
t = 1, we start a new tabic with — \ sec., using the previously computed 
values of x, y, x, j|, and v for the lines t = 0, t = t = 1. Here, again, 
the correct values of the several quantities are given at the first trial in 
the fourth and fifth lines of the table. So we double the interval again 
and start a new table with At — 1 sec., using the previously computed 
values for lines t = 0, t = 1, ( = 2. This new table is continued up to 
the line t = 8. Then the interval is doubled once more and a new table 
started. The computation is continued with this interval until the problem 
is finished. In most cases only one correction is necessary for x, and 
«, and none for y and 2. 

When using formulas (1), (2), (6), (7), (8), with At = 2 the student 
should not round off the numbers within the brackets before multiplying 
through by the factor 2 ; for by so doing he would double the error due to 



Art. 121J 


ILLUSTRATIVE EXAMPLE 


387 


rounding. He should also be careful not to discard fractional quantities 
of less than half a unit in the second decimal place until he is sure that 
the algebraic sum of these quantities is less than half a unit in the second 
decimal place. Attention to these matters, instead of being a waste of time, 
will frequently save the time and labor of recomputing a whole line in the 
table. For example, let us check the value of Ay in the line for t == 26. 
We have 

Ay = 2 [- 876.89 + ^ (47.27) - ^ (3.04) - ^ (0.15)] 

= — 763.78 + 47.27 — ^ (3.04) — ^ (0.16) 

o 12 

= — 763.78 + 47.27 — 0.507 — 0.012 = — 707.03. 

By rounding off before multiplying by 2 we have 


Ay = 2[— 376.89 + 23.64 — 0.25 — 0.01] = — 707.02, 

which differs from the previous value by a unit in the last figure. 

The preceding remarks apply with even greater force when At = 4. 

The final results of the computation for this problem are given on the 
following page. 

To find the time of flight we replace the terminal part of the trajectory 
by a parabola through the points corresponding to i = 22, t = 24, and 
t = 26. Hence y is to be a quadratic function of t, and we find this func- 
tion by constructing a table of differences and employing Newton’s inter- 
polation formula (II) of Art. 21. 


t 

y 

AiV 

Aty 

22 

1389.97 



24 

780 54 



26 

73.61 


-97.60 


Putting y = 0 in that formula, we have 


y. + A,y»tt + («• + u) = 0. 

.*. 73.61 — 707.031* — 48.8 (tt* u) = 0, 


48.8tt* + 766.83U = 73.61. 

^ -766.83 + 7 65.26 _ 9^ ^ 

97.6 97.6 ’ 


or 










388 SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS [Chap. XIII 



s 

? 

e 

d 

-0.03 

0.00 

-0.01 

0.00 

000 

-0.00 

-0.04 

-0.06 

-0.01 

-0.01 

0.00 

0.00 

0.00 

-0.01 

-0.04 

•3 

< 

88 

*? 

8 S 
d d 

838^88 
d d d e d d 

8228888888S 

ddddddddddd 


833 

??? 

888 

??? 

!9 ^ ^ 

CO el N »H PH PH PH 

iiiiiii 

8 1: 8 3 8 a 8 8 2 8 8 8 


S 8 8 3 

e e e e 

2 2 8 S 

O 

ssssssss 

8838382382888 

dcoei^pHPHddddddd 

:h 

S3888 

S8838 

8 8 S 8 2 

1 1 1 1 1 

SS828382S 

882222222 

1 1 1 1 1 1 1 1 1 

S S 8 i: S « 3 to 8 S 8 a 8 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 

•4r 

9SS3I 

7?77 

M.SS 

*4 ^ d d 

^ *4 pH 

1 1 1 1 

8S8S8388 

ddoodddcod 

CIm^pHpHpHpHpH 

1 1 1 1 1 1 1 1 

2888938888989 

ddddddddddioiod 

VCOCieiei*^PHPHpHPHpHpHPH 

1 1 1 1 1 1 1 1 I 1 1 1 1 

•N 

3883S 

am 

3 3 8 8 8 

iiiii 

388882888 

isi§3§s§s§ 

> a s K 8 s s s e 3 s s . 3 

isss'sissisisas 


o 

d 

O 

d 

ih PH pH pH ^ 

O O O O O 

o^ood 



8S 
d d 

8 o 
d d 

SS3S88 
d d d d d d 

38822288888 

ddddddddddd 

> 

3S8 
d d d 

1 I 1 

•H pH 

d d d 

1 1 1 

m « Q IQ pH 00 lO 

HF CO CO M M 

d d d d d d d 

1 1 1 i 1 1 1 

pjp^dddddddddd 

1 1 1 1 1 i 1 1 -f 

‘1 

« CO PN 

is r« r* 

d d d d 

8 3 88 

•M pH pH PH 

8 8 8 8 8 3 8 2 

eieici-i-HpHpHPH 

S2228S288833S 


sssas 

S 8 9 3 8 

1 1 1 1 1 

28228 

S 3 3 3 3 

1 1 1 1 1 

22883288& 

833338888 

1 1 1 1 1 1 1 1 1 

2828&S323S883S 

;33 2 888a;;888888 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 

I 

8S 

d d 

1 

S8 
d d 

1 1 

S cS S S M ^ 
d d d d d d 

1 1 1 1 1 1 

3 3 8 8 « 2 8 3 2 2 2 
eopHPndddddddd 
IIIIIII 

T 

82 2 
odd 

? O 

h- « 

d d d 

S 8 8 2 8 8 8 

Cl Ci ei ®H pH PH pH 

888838883883 

di^ddcocoeieieocicieo 


8382 

c« Cl d d 

7 7 7 7 

2328 

8388 

till 

SS388883 

33338888 

1 1 1 1 1 1 1 1 

aasaaaazasasa 

• 9» 

83328 

iiiii 

SSSiiS 

sssRsssxa 

S 2 3 S 8 3 8 S C 2 2 8 3 8 


8 388 

d d CO d 

9^ 

9^ 9^ 9^ 

3888 

ini 

islisssss 

' aaaaaaasajf'ass' 


8388 

*s§ss 

issa 

°siss 

88338888 

S82838SSS8838 

» 

88333 

inn 

saan 

Hess 

SSISSSiSS 




®x 


®"®®-S2288«a8« 




Art. 121] ILLUSTRATIVE EXAMPLE 889 

and 

/. « = «» + fctt = 26-f 2 X 0.0966 = 26.19 sec. 

We next compute the range by means of Simpson’s Buie. The horizontal 
distance covered during the first two seconds is, taking h = At = ^ sec., 

X = ^^[610.44 + 4(698.84 + 677.32) + 2 X 887,82 + 567.32] 

= 1176.3 ft. 

For the interval from t = 2 to t = 26, taking A = 2, we have 

X = §[667.32 + 4(631.69 + 476.21 + 434.09 + 399.07 
+ 366.90 + 336.36) + 2(501.77 + 463.93 -f- 416.00 
+ 382.81 + 351.12) + 319.58] = 10,181 ft. 

Hence the horizontal distance covered in the first 26 seconds is 10181 -f* 1176 
= 11,357 ft. 

To find the distance covered in the remaining 0.19 second we assume that 
the horizontal acceleration will remain at — 7.90 for 0.19 sec. Then the 
change in velocity during this time will be ( — 7.901 X 0.19 = — 1.5. The 
horizontal velocity at the pnd of 26.19 «iecon<ls will therefore be 319.6 — 1.5 
or 318.1 ft./sec., and the average velocity during this fraction of a second 
is (319.6 -f 318.1)/2 = 318.8 ft./8ec. Hence the horizontal distance covered 
in the last 0.19 sec., is 318 8 X 0.19 = 61 'ft. The total range is therefore 


X = 11357 + 61 = 11,418 ft. 


If we compute the mcrements in x for the several time intervals and add 
them as we go along, as was done in the case of y, we shall find the same 
value for the range as found by Simpson’s Buie. 

If u denote the angle of fall, then 


tanw = 


i 

X 


We have already found the value of Hot t=z 26.19. To find y we assume 
that the second difference in y will be the same for the interval t = 26 
to f = 28 as for the preceding interval. Then for the next two seconds 
we shall have Aijl = 1.64. Hence for one second the change in y will be 
0.82, and for 0.19 second it will be 0.82 X 0.19 = 0.16. The vertical 
acceleration when t — 26.19 will therefore be — 22.85 + 0.16 = — 22.69. 
The change in the vertical velocity during the last 0.19 second is then 


— 22.86 — 22.69 


X 0.19 = — 4.3. 


2 



390 


SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS IChap. XIII 


Hence y = — 376.9 — 4.8 = — 881.2. 


and«* = — 


tan« = 


—381.2 

818.1 


— 1.198, 


The terminal yelocity is 

V = = V (318.1)*+ (381.2)* = 496.6 ft/sec. 

The value given by formula (119.1) is also 496.6. 


BZBRCISB ZV 

The motion of a bullet is determined by the equations 

~ = — 0.000035V ^ 
at at 

= — 0.000035V ^ — 32.16. 

If the initial conditions are v — 800 ft./8ec., ^ = 692.82 ft./8ec., ~ =: 400 

at at 

it/sec. when t = 0, find the horizontal range and time of flight of the 
bullet. 



CHAPTER XIV 


THE NUMERICAL SOLUTION OF PARTIAL 
DIFFERENTIAL EQUATIONS 

182. IntrodttCtioil. One of the greatest needs in applied mathematics 
is a general and reasonably short method of solving partial differential 
equations by numerical methods. Several methods have been proposed 
for meeting this need, but none can be called entirely satisfactory. They 
are all long and laborious. 

Soon after Runge discovered his method of solving ordinary differential 
equations, Cans * extended the method to partial differential equations 
with given initial conditions. Some years later, Willcrs* extended the 
improved Runge-Kutta method to the solution of partial differential equa- 
tions with given initial conditions. These methods are slow and laborious 
and have not come into general use. 

Certain types of boundary-value problems can be solved by replacing the 
differential equation by the corresponding difference equation and then 
solving the latter by a process of iteration. This method of solving partial 
differential equations was devised and first used by L. F. Richard.son.' It 
was later improved by H. Liebmann * and further improved more recently 
by Shortley and Weller.® The process is slow, but gives good results on 
boundary-value problems which satisfy Laplace’s, Poisson’s, and several 
other partial differential equations. A strong point in its favor is that the 
computation can be done by an automatic sequence-controlled calculating 
machine. 

A somewhat similar method is the relaxation method devised by R. V. 
Southwell.® This method is shorter and more flexible than the iteration 
method, but is not adapted to automatic machine computation. In both 
of these methods the approximate solution of a partial differential equation, 
with given boundary values, is found by finding the solution of the corre- 
sponding partial difference equation. 


301 



392 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


I. DIFFERENCE QUOTIENTS AND DIFFERENCE EQUATIONS. 


123. Difference Quotients. A difference quotient is the quotient ob- 
tained by dividing the difference between two values of a function by the 
difference between the two corresponding values of the independent variable. 
Thus for a function f{x) of a single variable the difference quotient is tho 


familiar expression 


f{x + h)-fix) 


whose limiting value is the deriva- 


tive of /(x) with respect to x. A difference quotient is thus an approxi- 
mation to the derivative, the approximation becoming closer as h becomes 
smaller. 


Partial-difference quotients of the second and higher orders are best 
constructed with reference to a network of points in the xy-plane for a 
function of two variables and in space for a function of three variables. 
For a function u{x,y) of two variables, let the xy-plane be divided into a 
network or lattice of squares of side h, by drawing the two families of 
parallel lines 

X = tnh, m = 0, 1, 2, • • • 

y = nh, n=:0, 1,2,* • • 


as indicated in Fig. 18. The points of intersection of these families of 
lines are called lattice points. 

With reference to Fig. 18, the forward first-difference quotient of u(x, y) 
with respect to x is 


(123.1) 


u. 


u{x + h,y)—u{x,y) 
h 


and the backward first-difference quotient with respect to-x is 


(123.2) 


tf(g — ft.y) 


The second-difference quotient of u{x,y) with respect to x is the 
difference quotient of the first-difference quotients (123.1) and (123.2). 
Hence we have 


(123.3) vs. 


V, — us 


h 


u(x + h,y)—u(x,y) u(x,y) — u(x — h,y) 
h k 

h 


+ Ky) — 2tt(x, y) + tt(x — h, y) 
A* 



Abt. 123] 


DIFFERENCE QUOTIENTS 


398 


The first- and second-difference quotients of u(x,y) with respect to y 
are found in exactly the same manner and are 

(188.4) u(x,y)-u{x,y- h ) ^ 

h h 

and 

(183.6) ^^»(x,y + »-8u(», >) + »(», y-6) 













(x,: 

f+2h) 







(x,i 

r+h) 





(x-2 

(x-^ 

.y) O'' 

y) (x^ 

h,y) (x+ 

2h,y) 





(x,y 

-h) 







(x,y 

-2h) 













Fio. 18 


Higher difference quotients are found in exactly the same manner except 
that additional lattice points must be used. 

Difference quotients of a function u{x,y,z) of three variables are found 
by the same process as used above. Thus for the second-difference quotient 
with respect to s we have 

(128. 6) u„ = y» « + ^) — y, *) + «(g> y,t—h) 


394 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chaf. XIV 


The reader will note that the differences used in finding all these 
difference quotients are central differences. When central differences are 
used, the inherent error made by replacing a second deriratire by a second- 
difference quotient is proportional to h* if h is small. This fact can be 
shown by replacing the terms u(x-{-h,y) and u(x — h,y) in (123.8) 

d*tt 

by their Taylor expansions and then comparing ttsa withT-j-. 


Difference Equations. The difference equation corresponding to 
a given differential equation is found by replacing the derivatives by the 
corresponding difference quotients. The functions u(x,y) and u{x,y,t) 
occurring in the difference equations are defined only at the lattice points, 
but we can make these points as close together as desired by decreasing h. 
In order to get simple procedures for solving the difference equations we 
shall assume that the given differential equation is exactly satisfied by the 
difference quotients. The magnitude of the inherent error resulting from 
this assumption will be investigated later (Art. 126). The difference 
equations corresponding to several well-known partial differential equations 
are given below. 

(a) Laplace's equation for two dimensions. 


d*V ■ d*V 
dx* 0y* “ 

d*V 3*V 

Beplacing and by uj, and ufg, respectively, we get 

u{x + h, y) — y) + v(g — K y) 
h' 


from which 


, tt(z,y-|-h) — 2tt(g.y)4-u(a;,y — h) ^ 

f ^2 - w* 


(IM. 1) !!(», y) = i [u{x -t- fc, y) -f u{x, y + h) 

+ u(x—h, y) + u(x,y—h)]. 

This equation shows that the value of u at any interior lattice point is 
the arithmetic mean of the values of u at the four lattice points nearest it. 


(h) Laplace's equation for three dimensions, 

d*V I d‘V . 8»F 



Abt. 124 ] 


DIFFERENCE EQUATIONS 


896 


Replacing the second derivatives by the second-difference quotients as given 
by (128.8), (188.6), and (188. 6), and solving for we get 

(2) u(x, y,z) = ^ [ti(® -f h, y, z) + u(x, y + h,z) + u{x, y,z + h) 

+ u{x — A, y, z) + u(x,y — A, *) + «(*, y,z — A)] . 

This equation (2) shows that the value of u at any interior lattice point 
in space is the arithmetic mean of the values of u at the six lattice points 
nearest it. 


(e) Poisson's equation in two dimensions, 

+ ^ = — 4*F(»>y)* 

Replacing and 0^by and Uyy, respectively, as given by (183.8) 
and (123. 6), we get 
(184.8) u(x,y) 

= i [«(*+*> y)+u(x, y+h)+u(x—h, y)-\-u(x, y—h)'\-\-nh*p(x, y). 


Here the value of u at an interior lattice point depends not only on the 
tt’s of the adjacent points but also explicitly on the value of A and the 
function p(x,y). 

(d) The equation of heat conduction in a plane, 

^ , (d^T ,d^T\ 

dt ® \dx^~^dy^/ 


Here t denotes time and T denotes temperature at any time and place, or 
T — T(x,y,t), and a* is a constant. If the temperature of the plane area 

3r 

has reached a steady state, so that = 0, then the above equation reduces 
to Laplace’s equation. 

If the steady state has not been reached, the temperature at any point 
depends on the time. Hence the difference quotient at any lattice point 
at time t is 


_ T{x, y, < -]- At) — T(x, y. t) 

~ At 

The second-difference quotients ZVe and Tg, at any instant {t fixed) are 
given by (123.8) and (128.6) if « is replaced by T. Hence on replacing 


26 



3M SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 

the deriTatiTee in the heat equation hj the oorreqxmding difference 
quotients, we get 

T{x, y, < + A<) — T(x, y, Q 
At 


= 0* 


T{x 4 - h, y, <) — 2T{x, y, <) + T{x — h, y, <) 
A* 


4 


T(x, y + h,t) — 2T(x, y, t) + T(x, y-h,t) 

A» 


from which 


T(x,y,t + At) 

= Tix,y, 04-^ A<[r(a; + h,y, i) + T(x,y + h, t) + T(x—h,y,0 

+ r(x,y—k,t)-^4r(x,y,0J 

h* 

Since At is an arbitrary increment of time, we may set At = Then 
the shore equation reduces to 
(4) T(x,y,t + At) 

= i [r(* + A, y, t) + T(x, y + A, t) + T{x - A, y, t) 4- Tix, y - A, t)] . 


This equation gives the temperature at any interior lattice point at time 
t -|- At as the arithmetic mean of the temperatures of the four adjacent 
lattice points at time t. If the temperature has reached the steady state, 
we have 


( 184 . 3 ) Tix,y,t) 

= \[T(x + A, y, t) + r(*, y 4* 0 4" — K y» 0 + 2'(», y — A, t)]. 

Other types of partial differential equations can be replaced by partial 
difference equations by proceeding as in the above examples. 


n. THE METHOD OF ITERATION. 

126. Solution of Difference Equations by Iteration. We consider a 
process of solving Laplace^s equation in two variables and with given 
boundary conditions. For simplicity we assume that the function «(x, y) 
is required over a rectangular area. We therefore cover the area with a 
network of squares of side A, as shown in Fig. 19. 

Since the boundary values of the desired function are assumed to be 
known, we denote them by a*s, as indicated in Fig. 19. The values of the 



Abt. 126] 


SOLUTION BY ITERATION 


897 


xeqoixed function at the interior lattice points are unknown, but in order 
to start the iteration process by equation (124. 1) we compute rough Tslues 
for them as shown in the solution of Example 1 below. 

We start the iteration process by computing an improved value of «i 
by means of formula (124.1), the new or improved value of Wi being 



denoted by u^. Then we proceed to improve tt 2 in the same manner, and 
so on with all the other interior lattice points. The traverse, as it is 
called, proceeds over the network in the order in which the points are 
numbered. Thus we have 

v^i — ^(tts "i" o* "t" ®»4 "I* Wt) 
w'l = i(«i + at + + «t) 


— i(a« ■(” ti\ -i" Oti ~1~ Uii) 
ii\ = *4" **% “f* Wit) 


1*%4 = i(ai2 + 1*^12 + 1*^2 + 0 x 4 ). 


Note that improved values are used as soon as available in computing 
improved values for points ahead. 




398 SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 

The piocess outlined above is repeated as long as it produces any improve- 
ment in the «*s. For the second traverse we would start with 

u"i = 4- Oa -I- Ou -f u'j), 

etc. 

In solving partial-difference equations by the iteration process, it is 
advisable to start with a coarse net (large value ot h). Then when itera- 
tion gives no further improvement in the u's, the whole process is repeated 
with a finer net (smaller value of h) and the iteration carried on until 
no change occurs in the it’s. 

Example 1. Solve the Laplace difference equation for a square region 
and having the boundary values shown in Fig. 20. 

Solution. We start with a coarse net by dividing the given square into 16 
smaller squares, as shown in the figure. 

To get initial values for the interior points of the network, we first find 
a value for Ut at the center of the square by taking the mean of the four 
boundary values at the ends of the heavy lines drawn at right angles 
through the center. Then we find the values for the centers of the four 
large squares into which the given square is divided hy the heavy lines 
through the center. These values are found by taking the means of the 
values at the ends of the diagonals of the large squares.* The values for 
the four remaining interior points lying on the heavy central lines are 
found by taking the means of the four adjacent points in each case ; that is, 
the four nearest points lying on the horizontal and vertical lines through 
the points considered. The computation of some of the interior values is 
shown at the bottom of Fig. 20. 

Now having all the boundary values for the network and rough values 
for the interior lattice points, we are ready to start with the iteration 
process. Beginning with the first interior point in the upper left-hand 
corner of the square, we proceed to the right until the last interior point 
on the line is improved. Then we drop down to the next line and proceed 


* This method of taking the mean of the values of the function at the ends of 
the diagonals of a square is perfectly legitimate, because if we make a transforma* 
tion of coordinates by rotating the x- and y axes tiirougli 45*, the ends of the 
diagonals are on the new coordinate axes; and from the transformation equations 


V2 


1 a*u a*u 8*« a*u 

■F*)' F = ^(»' + F') (See E. 


B. Wilson’s Advanced Calculus, p. 112, Ex. 25, or Vall^e-Poussin's Coura d'Analyse 
Infinitesimale, I (4th Edition), p. 169, Ex. 6.) Hence u satisfies Laplace’s equation 
in the new coordinates, and Equation (124.1) is therefore valid here. 



Art. 126] 


SOLUTION BY ITERATION 


391 



U|-^(l2.5+l7.0+0+0)-7.4 U9=*Sf<2l.0+l2.5fl2.l+9.0)»l3.6 


U3-^(I8.64I7.0+I2.5+2I.0)«I7.3 U8-y(l3.6+l2.546.2+l2.l)=l I.I 

U2- - 5 ( 17 . 3 + 17 . 0 + 7.4 +12.5)* 13.6 U4-5(I2.5+7.4+0+6.2)»6.5 

U«‘‘-5(2I.0+I7.3+I2.9+I3.6)M6.I 


Flo. 20 


400 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


from left to right with the interior points in that line and so on, the order 
being the same as that followed in reading the lines of a printed page. 

The first improved value in the top line of interior points is found 
from formuTa (1S4. 1) to be 

= i(13.6 + 11.1 + 0 + 6.5) = 7.8. 

The improved value for the next lattice point to the right is 

vft = i(17.3 + 17.0 + 7.8 + 12.6) = 13.65, 

which, for the sake of simplicity, we round off to 13.6. 

The process is repeated until it produces no change in the values of the 
interior points. 

The next step in the computation is to halve the value of h, and repeat 
the iteration process with the smaller squares. The initial values for the 
new mesh points are computed just as in the previous case, by taking the 
the means of the values of the corners to get the values at the centers of 
the previous larger squares and then finding the remaining values by 
taking the means of the nearest points on the horizontal and vertical lines 
through the point considered. Thus, for the value at the center of the first 
of the previous squares we have 

til . J(7.8 + 11.1 + 0 + 0) = 4.7. 

For the value at the center of the next old square we have 

u, — i(13.6 + 17.0 + 11.1 + 7.8) = 12.4. 

Then for Uj of the new mesh points we have 

«, — i(12.4 + 11.1 + 4.7 + 7.8) = 9.0. 

The remaining initial values for the new network are found in like manner. 

Having initial values for all the lattice points of the new net, the com- 
puter then begins the iteration process for the new network. The first 
improved value for the first interior mesh point in the upper left-hand 
comer is (see Fig. 21) 

u\ = i(9.0 -I- 7.7 -f 0 -f- 4.0) = 6.2. 

Two applications of the iteration process suffice to complete the solution 
for the new value of h. If desired, one could halve h again and get a 
closer approximation for the solution of the given example. For this new 
computation with a still smaller h, additional intermediate boundary values 
would have to be interpolated, estimated, or scaled from a curve plotted 



Am. 125] 


SOLUTION BY ITERATION 


401 


from the given boundary values of Fig. 20. The computed interior points 
will then be no more accurate than the new boundary points. 



Fio. 21 

Remark. The reader should keep in mind the fact that the computed 
values at the mesh points in a network are determined by two things : 

1) . The given differential equation. 

2) . The set of given boundary values. 

Hence if the boundary values are known to only two or three significant 
figures, it is useless to compute the interior points to more figures. 




S- S- 


402 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


Example it. Solve the Laplace difference equation for a square region 
having the boundary equations shown in Fig. 24.* 

Solution. Here the value of u on the boundaries is given by definite 
analytic expressions which may be evaluated at any point on the boundary 



any desired degree of accuracy. Such boundaries also make it possible 
express u analytically as a Fourier series and thus enable us to compare 
the numerical solution with the Fourier solution. 

We start with a coarse net by dividing the given square region into 

*Tlie given boundary values in this example are quantities of zero dimensions, 
or pure numbers. Hence tbe computed values at all interior mesb points will like* 
wise be pure numbers. 






Art. 125] 


SOLUTION BY ITERATION 


403 


16 squATos. Approximate values of u at the mesh points are found as 
already explained in the preceding example. Five applications of the 
iteration process gave the results shown in Fig. 22. 

We then halve the value of h and make a new computation, as indicated 
in Fig. 23. The initial values at the new interior mesh points are found 



Fio. 23 

as explained in the previous example. All initial values are written above 
the corresponding mesh points. The final values for the interior mesh 
points are written below the points. The iteration process was applied 
20 times to get these final values. By noting the time required for one 
traverse, it is an easy matter to estimate the time required for solving 
this problem. 

Additional values of the function in the region of the network can be 
found by the following procedure: 





404 SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 

Find the values at the centers of the squares by applying formula (124. 1) 
to the final values just obtained for the net points. Then find the value 
of the function at the midpoints of the sides of the squares by applying 
(124. 1) to the new center points and the final values at the mesh points. 


U 



Y 


Fio. 24 

The last points to reach their stationary values were those near the 
center of the region, and it may therefore be fairly assumed that the 
values at these points are the least accurate of aU; that is, the differences 
between these computed values and the true values of the function are 
greatest for these points. On the other hand, a Fourier series for the 





Art. 126 ] 


INHERENT ERROR 


406 


function gives its most accurate values for these points near the center. 
Hence the difference between the given computed value and the value given 
by a Fourier series for points near the center of this region will be very 
close to the true error of the computed values for those points. 

The Fourier series for Ui,s, or for the point x = 3, y = 2, is a rapidly 
converging alternating series whose value is 6.9535 correct to four decimal 
places. The approximate error in the difference equation is thus about 
6.9535 — 6.920, or about 0.03. 

Figure 24 represents the surface u = f{x, y) whose ordinates have been 
computed in this example. 


126. The Inherent Error in the Solution by Difference Equations. 
The inherent error in the difference-equation solution of a differential 
equation can be found by expressing the difference quotients in terms of 
derivatives, and this can be done by means of Taylor’s formula. 

Taylor’s formula for a function of two variables can be written sym- 
bolically in the form 

( 1 ) + + 

When h = 0, this becomes 


/(® + h,y) = f(x, y)-\-h 
from which 


a*^2iar»^3ia*»^4I 3**^ 


( 2 ) 


fix + h,y)-f(x,y) df hd*f h* ay h'd*f 

h -dx^2\dx‘^ sidx'^ ^Idx*"^ 


Changing A to — h in (2), we get 


h ~dx 2!ax*'^3!3x» ^\dx*^ 


Forming the second-difference quotient by subtracting (3) from (2) and 
then dividing throughout by h, we have 

... f(x + *, y) — 2/(g> y) + /(g — y) __ 9*/ . 2 ** 

h* ■■3x*'^4iax* 

-f terms in h*, h*, etc. 

Likewise, on putting A = 0 in (1) and proceeding exactly as above, 
we get 

rgv /(x, y + h) — 2/(x, y) + f(x, y — k) _ d^f 2k* d*f 

fc» “ay* "^4 1 ay* 

terms in k*, fc*, etc. 



406 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


Putting jk = ^ for a square net, using the notation of Art. 123 for 
second differences, and then adding (4) and (6), we get 


( 6 ) 


. , 0*tt , 3*1* , 2h* 3^1* 

“ 3** 3y» 4 1 3** 


2A*3<1* • 14 r« A 

+ terms m h*, h*, etc., 


ft 1 I ,4 • 14 i« * 

= 0 + + — j + terms in h*, fc«, etc.; 


3*1* 3*1* 

since ^ ^ = 0 by hypothesis. The error committed in writing 

ttf* + Vra = 0 is thus a power series in even powers of h, the principal 
part of the error being the first term of this series. 

Now if we solve the equation Um-)- 1 *^ = 0 and thereby neglect the 
error terms, the error in the solution will be the integral of these error 
terms; and since h is independent of x and y (has a fixed value throughout 
the integration), the integral of the error terms will be a power series in 
h*, h*, etc. Hence when h is small the principal error in the solution will 
be the first or h* term. We are therefore justified in assuming that the 
inherent error in the difference-equation solution of a partial differential 
equation of the second order is proportional to h*. 

To find a simple formula for the inherent error, we have by hypothesis 

E = ch*f 


where E denotes the error and c is a constant of proportionality. Then 
for any two values hi and 1*2 of h, the corresponding errors are Ei = chi' 

and El = chi*, from which ^ ^ or Eiz= If fi'cn 


(7) Ei = iEi. 

Let Oi and ai denote the final approximate values of the function u 
at any interior mesh point, corresponding to hi and ^2 respectively. Then 

1* — Oi -f- El , 1* — (*2 ”f” El . 

Eliminating u and taking account of (7), we get 


( 8 ) 


Ei = ^{ai — ai). 


This formula gives the approximate value of the inherent error at 
each intersection point of the network after two values of h have been 
used, the second value of h being half the first value. 

Since u = at-\- Ei, we can substitute the value of Ei from (8) and get 

(9) t* = a8 + i(«» — «i)> 

which gives a dose approximation to the true value of u at any net point. 



Abt. 1271 APPLICATION OF CONFORMAL TRANSFORMATION 


407 


As aa application of formulas ( 8 ) and (9), let us consider the error 
in u at the point a; = 2, y = 2 of Example 2 , Art. 126. There Oi = 6 . 886 , 
02 = 6.920. Hence 

E, = i(6.920 — 6.836) = 0.028 
tt = 6.920 + 0.028 = 6.948. 

This value agrees closely with the extremely accurate value given by 
Fourier’s series. 

Note. Although the Fourier series solution gives very accurate values 
near the center of the region of Example 2 , it gives very poor values on 
and near the boundaries even when 12 terms of the series are used. On 
the whole, the solution by difference equations is preferable in that example 
and is obtained by much less work. 

127. Application of Conformal Transformation to Certain Problems. 
In the determination of stresses in thin plates by photo-elastic methods, 
it is sometimes desirable to transform an area bounded by circular arcs 
into a rectangular area which can be divided into a network of squares. 
The appropriate transformation is 

u; = In s, 

where z = a; -f ty = r(cos 6 -J- isin 6 ) = rc** and «> = tt -f- iv. On replacing 
V) and z by their values in terms of u, v, r, $, we have 

tt -f- tv = In re** = In r 16 . 

Hence, on equating real and imaginary parts, 

( 1 ) tt = lnr, v = 0. 

In order to transform the area bounded by two concentric circles and 
any two radii, denote the inner and outer radii by r, and ft, respectively, 
and take one boundary as the line 6 = 0 (see Fig. 25). Then from the 
first of equations ( 1 ) we have 

tti = In r,, tt 2 = In rx. 

The width of the transformed rectangle is Ux — tt, (see Fig. 26); and if 
the rectangular area is to be divided into squares of side h, the side of 
one of these squares is 

' ' nn 

where n denotes the number of subdivisions of Ux — Ui . 





Art. 127] APPLICATION OP CONFORMAL TRANSFORMATION 


400 


To find the subdivision points in ilfi corresponding to those of «3 — «i > 
we write the equation Ui = In n in the form ri = e**, give «i an increment 
Au, and compute the corresponding increment in r. Thus, 

ri = e** 

+ Ar = 

Af = __ gut -- — 1), 

or 

(8) Ar = f,(eA- — 1). 

On putting Au = h, 2h, • * ■ (n — l)k we get the subdivision points along 
AB. Note that Ar is measured from the point A on the circle r = Vi. 

To find the subdivisions of the angle from $=:0 to $ = Of we consider 
the second of Equations (1), from which 

Av = A$. 

Since the area in Fig. 26 is divided into squares of side h we must have 
A« = k. Hence 

f4) A6 = fc = 

where Ad is in radians. 

Having decided on the size of the squares in the transformed area 
A'B'C'iy in Fig. 26, one can compote the corresponding mesh points of 
Fig. 25 by means of Equations (2), (3), and (4). Note that the boun- 
dary ABCD of Fig. 26 is transformed into the boundary A'B'C'D' of 
Fig. 26, that a point P of Fig. 25 goes into P' of Fig. 26, etc. Note also 
that the transformation gives values of the function closer together on 
the concave side of the given curved area than on the convex side — a 
desirable circumstance in stress problems. 

Example. If ri = 2.2, u = 3.8, n = 4, find the subdivision intervals 
for r and $. 

Solution. From (2) we have 

^ _ In 3.8 — In 2.2 _ 1.3360 — 0.7885 _ ^ 

~ 4 4 

Hence Ad = 0.1366 radian = 7** 60'. 

(Af)i = 2.2(«®-^*“ — 1) = 0.322 
(Ar), = 2.2(«*"" — 1) = 0.691 
(Ar), = 2.2(e®"*» — 1) = 1.114. 

These Ar*s are to be measured outward from the point where r = 2.2 on 
the line d = 0.. 



410 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


ni. THE METHOD OP RELAXATION. 


188. Solution of Difference Equations by Relaxation. In solving the 
Laplace difference equation by iteration, we employed the relation 


or 


«o = + tt* + «» + «4), 

+ + — 4ub = 0 


until the equation was satisfied at any interior lattice point of the network. 
Except in the final stages of the process this equation is only approximately 
satisfied, the approximation becoming closer as the iteration process con- 
tinues. Let ^0 denote the residual, or discrepancy, at the lattice point Uo, 
so that 


(128. 1) Co = tti + «a + «» + «4 — 4tta . 


A similar residual equation holds for any other interior lattice point. 

To solve the Laplace difference equation by the method of relaxation 
we divide the region into a network of squares, write down the known 
values of the function at the net points on the boundary, and then compute, 
estimate, or assign values for the function at all interior net points, just 
as was done in Example 1 of Art. 126. The next step is to compute the 
residuals at all interior net points by means of equation (128.1). The 
object of the relaxation process is to reduce all residuals to zero, as nearly 
as possible, by continued alteration relaxation ”) of the values of the 
function at the interior lattice points. 

But when the value of the function u is changed at a lattice point, the 
values of the residuals at the adjacent interior points must be changed by 
exactly the same amount. Furthermore, the residual at the given point 
must be changed by — 4 times the change in the function at that point. 
These facts will become clear from a consideration of equation (128.1) 
and an appropriate figure. 

Let Fig. 27 represent a portion of a lattice network. Consider the 
point CZ. The residual at this point is 

(2) Qm = — 4m. 

If m is altered by an amount Am, Qm is necessarily altered by some amount 
AQm- Hence 

Qm + A^*, = n 4" A “I" 1 4" *■ — 4(m 4" Am). 

Subtracting (2) from this equation, we get 
(128.2) A0* = — 4Am. 



Art. 128 ] 


SOLUTION BY RELAXATION 


411 


The change in the residual at C3 is thus — 4 times the change in the 
function at that point. 

Let us now see what happens to the residual at C2 when m is changed. 
The residual at C2 is given by 

(4) Q, = m + g-\-k + q — il. 

A change in m necessarily changes Qi according to the relation 

Qi + ^Qi = m-|-Am4-54-fc + 5 — 4Z. 


A 

i 

> •: 

1 1 


% 

5 

M 

D 


■ 

c 

■ 

e 

D 

P 

f 

g 

■ 

■ 

• 

J 

n 

■ 

■ 

m 

n 

0 

p 

p 

q 

r 

s 

f 

t 

u 

m. 

w 

X 

y 


Fio. 27 

Substracting (4) from this equation, we get 
(128.3) = 

It is thus apparent that when any functional value is altered (relaxed), 
the residuals of the adjacent interior points must be changed by an equal 
amount. The relations (128.2) and (128 3) must be strictly observed 
every time a functional value is changed. When the residuals are changed 
as required by (128.2) and (128.3), their resultant values will always 
be the same as those computed by formula (128.1). 

Arithmetical mistakes are exceedingly apt to occur in working a problem 
by the relaxation process, due mainly to the fact that the computer will 
forget to correct some of the residuals at the adjacent points or else will 
make mistakes in combining the new alterations with the previous residuals. 


27 










412 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


Hence the computer must be extremely careful to make all required 
corrections arising from a given point before he goes on to the next point. 

We shall explain the relaxation method further by applying it to the 
two examples worked in Art. 125. 



Fio. 28 


Example 1. Solve Example 2 of Art. 125 by the method of relaxation. 

Solution. We take from Fig. 22 the boundary values and the approxi- 
mate values of the function at the interior lattice points, as shown in 
Fig. 28. Then we compute the residuals for all interior points by formula 
(128.1). The relaxation process may be started at any point, but it is 



Art. 128 1 


SOLI TIOX BY RELAXATION 


413 


advisable and customary to begin near the center of the lattice area and 
at the point having the largest residual. Then proceed to the point having 
the next largest residual, and so on. Furthermore, in order to “ liquidate *' 
a residual at any point, the increment of the function at that point must 
have the same algebraic sign as the residual and must be one-fourth as 
large. Hence to find the required magnitude of the relaxation at any 
point vre divide the residual at that point by 4. 

The largest residual in Fig. 28 is at the point (73. Hence we relax u 

at that point by the amount --- = 0.274. Then according to equations 

(128.2) and (128.3) we must add — 4X0.274 to the residual at CZ 
and add 0.274 to the residuals at the four adjacent lattice points as 
indicated in the figure, the results of these additions being recorded as the 
new residuals at the affected points. 

The next largest residuals are at B2 and F4. Hence we relax at these 
points by the amount — 0.164 and correct all affected residuals according 
to equations (128.2) and (128.3). Note that we record the new value 
of the function at the relaxed point as soon as the relaxation is made. 

The greatest remaining residuals are at DZ and 2)4. So we relax at 
these points by — 0.110. Then we relax at 53' by the amount — 0.014 and 
at DZ by 0.014. D2 and 2)4 are next relaxed by 0.004, and then B2 and 
54 are relaxed by — 0.003. Then relax 2>3 by 0.002 and 53 by — 0.001. 
Now relax 52 and 54 by — 0.001. Finally, relax 53 by 0.001. No further 
improvement is possible without decreasing the size of the mesh squares. 
It will be noted that the final residuals satisfy equation (128.1), and that 
the final values of the function at the interior mesh points are the same 
(with one exception) as those found by the iteration process in Fig. 22. 
No computation by the relaxation method is finished until the final residuals 
are checked by (128.1) and found to satisfy that equation. 

The reader who is studying the relaxation method for the first time 
should work the above problem for himself by carrying out the compu- 
tation as outlined above. 

Example S. Continue the solution of the above example by the relaxa- 
tion process when the value of h is halved. 

Solution. We take from Fig. 23 the values of the function u on the 
boundaries and at all interior mesh points, and enter them in Fig. 29 as 
shown. Then we compute by formula (128. 1), the residuals at the interior 
points. The relaxation is started at 52 and 58 by relaxing at these points 
by — 0.041. The mesh-point values having the next largest residue are 
then relaxed and the process continued until no residual exceeds 0.002 in 



414 SOLUTION OF 1‘ARTIAI, DIFFKHENTIAL EQUATIONS [Chap. XIV 

magnitude. It is not possible to reduce all residuals to a smaller magnitude. 
The filial values of the function at the interior mesh points, and their 
corresponding residuals, are recorded below the dotted lines in Fig. 39, the 



Fio. 29 


corresponding initial values being written above the horizontal lines through 
the mesh points. It will be seen that the final value.s agree within a unit 
in the last digit with the value.s found bj iteration in Fig. 23. 

















Aiit. rj9| 


I A N( ; ULAH N KTWOH KS 


415 


The trend of the functional values in lines D to U enabled us to speed 
up the convergence of the relaxation process to some extent, as it soon 
became evident that these functional values were continually increasing. 
Hence it was safe to overrelax in this area; that is, instead of changing 
the functional values by amounts just sufficient to liquidate their residuals, 
we change the functional values enough to produce residuals with changed 
signs and as large as they were before. The new residuals will soon be 
wiped out by increments added when adjacent points are relaxed. Such 
over-relaxation is always advisable when the residuals adjacent to a given 
point have the .same sign as the residual at that point. 

In carrying out the eomputution for this itroldcni .sutiic of the functional 
values had to be relaxed 14 times, by amounts varying from 0.017 at first 
down to 0.001 at the end. The number of relaxations could have been 
decreased by drastic overrelaxation in the central region of the network. 
But drastic overrelaxation should not be reported to unless the computer 
knows about what the functional values sliould be in the end. 

129. Triangular Networks. Althougli the .srpiare network is the sim- 
plest and the one mo.st commonly used, a network of c(|uilatcral triangles 
is sometimes more suitable for a particular problem. This is likely to he 
the case in a region having an irregular or angular boundary. Fig. 30 
represents a portion of a triangular network. 



Fic. .10 


In this case the fundamental relation which must be satisfied in the case 
of liaplace’s equation is 

(1) Uo= i(u, -f ttj-I-tt, -f-tt,), 

and the residuals are given by 

(2) = Ml + “* + + «4 + U, -f tt* — 6tto . 



416 SOLUTION OF I'AKTIAL MFFERRKTIAL KOUATIOXS ,Ch\i*. XIV 

Formula (1) is derived on pages 21-33 of Southwell’s book mentioned 
above. Incidentally, Southwell points out, p. 24, that formula (1 ) is more 
accurate than (124.1). 

Formulas (1) and (2) are applied to triangular networks in exactly 
the same manner as (124.1) and (128.1) are applied to networks of 
square meshes. Formula (128.3) applies unchanged to triangular net- 
works, but (128.2) must be replaced by 

(8) Ag* = — 6Am. 

Hence to liquidate a residual at any point, we must relax by one-sixth the 
residual at that point and then add the increment to the residuals at the 
six adjacent points. 

130. Block Relaxation. Up to this point we have been altering or 
relaxing one functional value at a time. It sometimes saves time and labor 
to relax a whole group of functional values at a time. This procedure 
is advisable when the residuals at adjacent points in a region are nearly 
equal. In this case the functional values are relaxed as a block by changing 
them all by the same amount. We now consider the effect of such block 
relaxation on the residuals within the block and on those outside it. 

In Fig. 31 the group in the region surrounded by the heavy border are 
to be relaxed as a block by relaxing all functional values in the block 
(including those on the border) by an amount r. It is clearly evident 
that the residuals at interior points such as P are not altered by block 
relaxation; for although the residual at such a point is immediately 
changed by — 4e, .each of the four adjacent points a, h, c, d contributes a 
quantity « to this residual and thus leaves it unchanged by the block 
relaxation. 

Such is not the case at points on the border. The residual at each of 
these is immediately changed by — 4c, but the compensating contributions 
from adjacent points depend on how many of the adjacent points are in 
the relaxed block. The residual at point M, for example, is not changed, 
because the four adjacent points are all in the relaxed block. The same is 
true at Q and 8. At the point N, three of the adjacent points are in the 
block and one outside it; so the residual at N is changed by — c. At R, 
two of the adjacent points are in the block and two outside it. Hence at 
R the residual is changed by — 2c. 

The amount of change in the residual at any border point on a square 
network is evidently — nc, where n denotes the number of adjacent points 
which lie outside the relaxed block. 



Art. 130] BLOCK RELAXATION 417 

Similar considerations apply to a triangular network. At all interior 
points such as P in Fig. 32 the residuals are not changed, whereas at all 








J 






i 



k 

Id 

J 




■ 

■ 

■ 

1 

IKS 

9 

■ 




m 

■ 

1 

■ 

■ 




r 


L 




b 




1 


1 

1 


9 

9 

9 



I 





■ 

9 




i 



1 

9 

■ 

III 

■ 


9 










9 









n 

9i 









-4e' 

2e 

e 



Residuals at Border Points Changed by — nc, where 
n = Number of Adjacent Points Outside the Relaxed Block. 

Fro. 31 


border points the residuals are changed by^the amount — n«, where n 
denotes the number of adjacent points lying outside the relaxed area. 
At B, for example, the residual is changed by — «, whereas at A the 
change is — 4c. 

When the functional values in a block are rela.xed by an amount c, the 
residuals of all adjacent points just outside the block must also be changed 
by an amount c as required by formula (128.3). 


418 


SOLl’TION OF PARTIAL DIFFKRICNTIAL EC^FATIONS IChap. XIV 


In block relaxation all functional values in the block are changed by 
the same amount, but no residuals are changed except those at the points 
on the border. This procedure may thereby save much time, and it also 
reduces the possibility of arithmetical errors. 



KesiduaU at Border Points Changed by — wc, where 
n = Number of Adjacent Points Outside the Relaxed Block. 

Fio. 32 


The amount by which all functional values in a block are relaxed may 
be anything desired. Southwell relaxes a block of values in accordance 
with the formula 

* m 

where SQ denotes the algebraic sum of the residuals in the block and m 
denotes the number of adjacent points just outside the block. (The outside 



Art. 130] 


IILOCK RELAXATION 


410 


adjacent points should be thought of as outside connections to the block 
points. Sometimes an outside point is connected to two block points and 
is then to be counted as two adjacent points outside the block. See Fig. 33. ) 

Example. Let it be required to iela.Y as a block the functional values 
within the bordered area of Fig. 33, only the residuals being shown. 











26 

26 

26 





26 

31 

-21 

27 

1 

16 

-34 

52 

26 



26 

26 

0 

30 

22 

19 

-7 

23 

-29 

26 


26 

17 

-9 

20 

16 

20 

24 

-2 

26 


26 

21 

-5 

16 

17 

19 

22 

-4 

26 


26 

25 

-27 

21 

14 

17 

20 

-6 

26 



52 

24 

-28 

! 19 

-7 

23 

-3 

28 

-24 

26 




26 

26 

26 

26 




Fio. 33 


Iferc 2<;> - -31 + 27 -]- 18 + • = oXl 


and 

TIence 


n = 22 


S8I 

22 ’ 


^ 26.4. 


« = 



420 SOLUTION OF PAUTIAL DIFFEllENTIAL EQUATIONS [Chap. XIV 

Using round numbers, we relax all functional values of the block by 26 
and chmigc the border residuals by 26n,'thc results being indicated in 
Fig. 33. Note that the residuals of the outside points adjacent to the 
border have been changed as retjuired by formula (128.3). 

One may al.so relax a block within a larger block that is to be relaxed. 
In that case the inner block should be relaxed first and the residuals at 
outside points adjacent to its border changed in accordance with formula 
(128.3). Then tlic larger block (including the relaxed inner one) can 
be relaxed as desired. 

After a group of values has been relaxed as a block, the whole network 
(both inside and outside the block) may be relaxed point by point in any 
manner until all re.siduals have been liquidated. 

131. The Iteration and Relaxation Methods Compared. The method 
of iteration and the method of relaxation are both methods for solving 
partial difference equations with given boundary values. Although they 
reach the desired solution by different processes, both methods are of the 
same inherent accuracy. Their points of similarity and dissimilarity are 
listed b*‘low. 

1. Both nu-thods require that the bounded region be divided into a net- 
work of s<|uares or other similar polygons. 

2. Both methods require that the boundary values be written down and 
that rough values of the function be computed, estimated, or assumed for 
all interior points of the network. 

3. In order to start a computation, the iteration method assumes that 
a functional value at any mesh point satifies the given difference equation 
(Laplace’s, Poisson’s, etc.) and thereby derives the relation which must 
exist between that functional value and the adjacent functional values. 
The process of iteration is then applied until the required relation is 
satisfied. 

4. The method of relaxation, on the other hand, recognizes at the start 
that an a.s.sumed functional value at any meslj point will not satisfy the 
given difference equation, but that there will be a residual at that point. 
The residuals are computed for all points before the relaxations process 
is started. 

5. The method of iteration starts with the upper left-hand corner of the 
network and proceeds to correct all net-work values by means of formula 
(124.1) (in the case of Laplace’s equation), using the latest computed 



Art. 131 ] 


ITERATION ANI) RELAXATION COMPARED 


421 


values available. The process is carried out in a systematic and definite 
order by going from left to right until the end of a line is reached and 
then dropping down to the next line, just as in reading the consecutive 
lines of a printed page. This method of correcting the netpoint values 
is continued until no further improvements can be effected by the iteration 
process. The iteration process can be performed mechanically by an auto- 
matic sequence-controlled calculating machine. 

6. The method of relaxation requires that the residuals at every interior 
netpoint be computed by formula (128.1). Then these residuals are 
liquidated or reduced to zero (or nearly so) as quickly as possible by 
altering (relaxing) the netpoint values to any extent that seems advisable, 
always observing that the increment of the function must be of the same 
sign as the residual at that point and being careful to correct all affected 
residuals in accordance with formulas (128,2) and (128.3). The revised 
values of the function should also be recorded at the time of alteration. 
The relaxation proce.ss may start at any interior netpoint and jump around 
all over the bounded region, usually beginning with the numerically largest 
residuals and then proceeding to the next largest wlierever they may be 
found. Hoeause of the perfectly arbitrary manner in wliich the relaxations 
are made, the relaxation process cannot be carried out by an automatic 
calculating machine. It is an individual, hand method ju.st as the slide 
rule is a hand device. 

7. The iteration jirocc.ss is .slow, sure, and fretpieiitly long. The relaxa- 
tion process is more rapid, less certain, and usually reasonably short. The 
convergence is rapid by both methods at fir^t, but becomes slow with both 
methods long before the end is reached. 

8. The arithmetic operations are easier and shorter with the method of 
relaxation. The mental effort necessary to avoid mistakes, however, is 
much greatel- than with the iteration method. 

J). The greatest drawback to the method of iteration is its length; the 
greatest drawback to the method of relaxation is its liability to errors of 
computation. Such errors can be kept out only by extreme care and 
unceasing vigilance on the part of the computer. 

10. (Computational errors in the method of iteration are immediately 
<jvidunt and are self-correcting. In the method of relaxation any errors 
in the functional values remain hidden and can be brought to light only 
by ajiplii-ation of formula (128.1). For this reason, all interior netpoint 
values should be checked by (128.1) several times during a long compu- 



422 


SOLUTION OF PAK'l'IAL DIFFfc.RKN I lAL KQUA FIONS iiai*. \1\ 

tation. Such chocking takes time and keeps the relaxation process from 
being as short as it might at first appear. 

11. In the iteration process, attention is always fixed On tllC functional 
values at the lattice points; in the relaxation process attention is always 
centered on the residuals at those points. 

When a computer discovers that a mistake has occurred somewhere in 
the relaxation solution, he should not spend much time in looking for its 
origin. Instead, he should compute all residuals by formula (128.1) and 
continue the solution with the new residuals. 

The reader should solve a problem of moderate length by both iteration 
and relaxation. Then he can decide for himself which method is pre- 
ferable in his case. 

Further information concerning short cuts, etc. in the iteration method 
can he found in the papers by Shortley and Weller; and additional informa- 
tion concerning the method of relaxation can be found in a valuable paper 
by Howard W. Emmons, entitled “The Numerical Solution of Partial 
Differential Equations ” (Quarterly of Applied Mathematics, Vol. II, No. 3 
pp. Oct.. 1!>H). and in H. V. .SimthweU'.' Itelaxatiou MiUio^y m 

Theoretical Physics, 1946. 

IV. THE RAYLEIGH-RITZ METHOD. 

132. Introduction. The KaOeigh-Hitz method of solving boundary- 
value prohlein.s is entirely different from either of the two methods con- 
sidered in the preceding pages. It is not based on difference equations 
and does not employ them. In finding the solution of a jihysical problem 
by this methoil, one assumes that the solution can be represented by a 
linear combination of simple and easily calculated functions each of whicli 
satisfies the given boundary conditions. After the problem has been 
formulated as the definite integral of the algebraic sum of two or more 
homogeii' oiis. jiositive, and definite quadratic forms, or as the quotient of 
two such integrals, the desired unknown function is replaced in the 
integrals by the assumed linear combination. Then the integral, or the 
([uotient of the integrals, is minimized with respect to each of the arbitrary 
constants occurring' in the linear combination. 

This method is direct and .short if only approximate results arc desired; 
but if results of high accuracy -are required, the method is quite laborious 
and the labor cannot be appreciably lessened by mechanical aids. The 
labor involved is mo.stly in l6ng and tedious algebraic manipulations. 

A special and simple form of the Kaylcigh-Ritz method was first used 



Art. 133 ] 


SOLUTION BY RAYLKIOH-KITZ METHOD 


483 


by liord Rayleigh * ( J. W. Strutt) for finding the fundamental vibration 
period of an elastic body. It was later extended, generalized, and its con- 
vergence proved by W. Ritz.* We shall attempt to explain it to some 
extent by applying it to two examples. 


133. The Vibrating String. Consider a tightly stretched ela.stic string 
or wire of length I and fixed at the ends, and assume that it vibrates in a 
vacuum. Let P denote the tension in the string and let p denote the 
mass of unit length of the string. With coordinate axes as shown in 
Fig. 34, let y denote the displacement of any point along the x-axis. 


Y 




Then y is evidently a function of both the distance x and the time t. The 
differential equation for the motion of a point of the string is thus a 
partial differential equation and is easily shown to be * 


(1) ^ ~ p 1^’ boundary conditions y(0) = 0, y{l) = 0. 

If we also impose the initial condition that ^ = 0 when t = 0, we find 

ot 

by the method of separation of variables that the solution of (1) is 
(2; y (7 sin -j- X cos \ — —t, n = 1, 2, 3, - ■ • . 


I 


The vibration frequency is therefore 


424 


SOLUTION OF PARTIAL DIFFERKNTIAL EQUATIONS IOhai-. XIV 


For n = 1 we get the natural or fundamental frequency. Hence for this 
frequency we have 



The higher frequencies or overtones are found from (3) by putting 
n = 2, 3, • • • etc. 

To find the vibration frequencies of the cord by the Rayleigh-Ritz method, 
we neither set up a partial differential equation nor solve one. On the 
contrary, we assume that the cord is vibrating in a vacuum and utilize 
the fact that under this condition the total energy of the vibrating cord 
remains constant. Hence the maximum kinetic energy must be equal to 
the maximum potential energy. 

The potential energy at any instant is the stored-up elastic energy due 
to stretching. It is equal to the work done in stretching the string to its 
longer form in the bowed position. Since the total energy of the string 
is constant, the potential energy at the end of a swing when the string 
comes momentarily to rest is equal to the kinetic energy when the string 
coincides with the x-axis and is at that instant unstretched. 

Because of the elasticity of the material of the string, the deflection y 
at any point is, by Hooke’s law, proportional to the force in the y-direction. 
Hence the motion is necessarily harmonic and can be represented by the 
equation 

(5) y^Xeinwt, 


where if is a function of x alone and is a constant. 

From Fig. 34 it is evident that the increase in length of an initially 
unstretched segment dx ia ds — dx, or 

+ (i)’ ‘ 0 ’ 

and the work done in producing this stretch is ^ 

Hence the work done in stretching the whole string is 


On expanding the radical into a binomial series and neglecting , 
, etc., since the slope of the string is small, we have 



Aht. 133] 


SOLUTION BY llAYLEUni-RlTZ METHOD 


426 


/dy\* 

Now replacing ( by its value found from (5), we have 

-B/; (§)■*]“- 

This is the potential energy of the string at any time. Its maximum value 
is when sin = 1. Hence 

The kinetic energy of a segment dx of the vibrating string is ^pdx | 
and therefore for the whole string it is 

Replacing “ by its value from (5), we have 



dt 


T= cos* wf. 

This has its maximum value when cos*w< = l. Hence the maximum 
kinetic energy is 

Since Umax = Tmax , we get 



The next step m solving this problem by the Ritz method is to choose 
for X a simple function which satisfies the boundary conditions and 
contains several parameters. These parameters are to be chosen or deter- 
mined so as to make the right member of (6) a minimum. A suitable 
expression for X in the vibrating string problem is 

(7) Z = *(I — x)(oi -f o,x-f o,x* + - • •)• 



426 


SOLUTION OF PARTIAL DIFFEIIENTIAL Elil'ATIONS K'hai*. XIV 


This is substituted for AT in (6) and then the partial derivative of the 
right member with respect to each parameter is placed equal to zero. 
The result is a system of homogeneous linear equations in the unknown 
parameters. 

In order to reduce the labor' of finding the partial derivatives of the 
right member of (6), we differentiate it with respect to a typical parameter 
and derive a formula from which the linear equations are easily obtained. 
Bearing in mind that A', as expressed by (7), contains the independent 
parameters ai, a<, - - -Oh, we have by (6) and by using the rule for 
differentiating a quotient, 


nJJ. (Wt\ 

doAp y* j 




f*Ar*dr 

' 0 

" [ 

f.Y^dx' 

■Jo 

]■ 


or 


But, by (6), ^ ~ th’s substitution in 

the second term of the equation above, we get 


r ‘ 9 r 7^)* ^ r ‘ ^ r . ^ ^ 

*/o oo.v^o \dx/ yJt cat Jo 

Taking out the common factor ^ X^dx, which is not zero, we have 


_0_ 

da. 


or 

( 8 ) 


J 

da, 


where k = ~ . 


In order to increase the rapidity of convergence of the Ritz process, 
we shall move the origin of space coordinates to the midpoint of the 



Art. 133] 


SOLUTION BY RAYLEIGH-RITZ METHOD 


4*r 


string. Then the boundary conditions will be jf( — 1/2) = ff(l/2) = 0, 
and the appropriate polynomial to meet these conditions will be 


(9) X = (e* — «•) (Oi + 0,** -f. o,** + • • •)» 

where c = 1/2. It is to be noted that (9) gives only the modes of vibration 
which are symmetric to the position of the y-azis. 

Taking only the first term of (9) as a first approximation for X, we have 


Then 






" X*dx = at* r* ie* — 2c*x* + !t*)dx = 



t 


16 

16 


aiV. 


Substituting these in (8), we get 


d /8oi*c* ,16 , 16aie* SSkaiC* 

dfli V 3 ~*15®‘‘^/~ 3 16 


Hence ^ = ^ = The exact value of k as previously found is 

i* 

A = ^ = The agreement is thus fairly close. 

To get a better approximation, we take the first two terms of (9). 
Z = (c* — **) (o» + a^), 

and 

Hence 

and 


^ z= — 201® + 2ate*x — 4a,**. 
ax 


r* 8a,*c» ,16 . . 88 . , 

J.. “^ = — + 1» + 106 ’ 

r* ir,j 16 , , , 32 T I 16 , ■ 

J ^ ^ = 15 ^ + 106 + 316 


Substituting these in (8), taking the partial derivatives with respect to 
Oi and a, in turn, and then reducing slightly, we get 


( 1 - 


(1 


2kc« 

5 

2ke* 

7 


\ 1 2fcc* . - 

)oi +y (1 ^)at = 0. 

\ I ^ /6 -1 2kc* . g. 

)®i ^11 ^ )** — 6* 


88 



428 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


These two homogeneous equations will have a common non^trivial solu* 
tion only if the determinant of their coefficients is zero, or 



2fcc* 

^ 5 

8* 2kc* 

5 ' 7 ' 


2kc« 

^ 7 

c* 2kc* 

from which we get 

— 28c‘i; + 63 = 0. 

Solving this equation for 1e, we find 



, 25.63256 2.46744 

c* ’ c* ■ 

In the exact case we found 



P"* _ t _ 

n*ir* 

Hence 

for n = 1, I 

P 1* 

. _ 9.869604 

4c* - 4c* ~ 

“ 4c* * 

2.467401 
c* ^ 


= 0 , 


, , 9ir» 22.20661 

lorn = 3, l = 

On comparing the above Ritz values of Je with these exact values, we see 
that for the fundamental mode the Ritz value agrees with the exact value 
to five significant figures. 

Values of still greater accuracy can be found by taking the first three 
terms of the second parenthesis of (9) : 

Z = (c» — **) (o, + a,x* + o,®«). 

On evaluating fM dx and ^ X*dx^ differentiating them with 
respect to a,, a,, Oa in turn, and substituting the results in (8), we get 


K‘ 




2ic*\ 


231 


2fcc« 

9 


\ , c*/l 21 pc*\ 

83l)**'^38W 39 )®*“®' 



Abt. 133] SOLUTION BY RAYLEIGH-RITZ METHOD 429 

These homogeneous equations will have a common, non-trivial solution if 
and only if 



1 / j 8tc«\ c«/l 2fcc«\ e* 1 13 

36 V 9 y aV* 931/ 33 V 7 39/ 


Putting = X, expanding the determinant, and simplifying, we gat 

A* — 926A* + 8910A — 38610 = 0, 

where all the coefficients are exact numbers (not rounded). 

The smallest root of the above cubic equation is easily found by the 
Newton>Baphson method, starting with the approximate value 4.93488 
(or preferably 4.936) found in the previous calculation. The new value 
is thus found to be 

Ai = 4.934802217, 
correct to the last digit given. 

The other two roots are best found by taking the root Ai out of the given 
equation, by synthetic division, and then solving the resulting quadratic 
equation. The depressed equation is 

A* — 220.066197783A + 7824.02177410 = 0, 

the roots of which (found by the quadratic formula) are 

A. = 44.686811826, A. = 176.478886968. 

As a check on these values it may be noted that 

Ai "4“ *4* ^ ~~ 226.000,000,000, 


as required by theory. 

We will now compare these Ritz values with the exact values. Since 


A = 2jte*, 



, P n V P n V 

• “ p P “ p 4c* * 


A = 


nV 

"T* 


we And 



430 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


Hence 


Ai- j - 


9.8 69604401089 
9 


A, =^ = 44.4182198, 
A, = ^ = 123.37. 


= 4.934802200646, 


It will be noted that the Ritz value for At is correct to eight significant 
figures, that the value for Aj is hardly correct to two figures, and that the 
value of At, although of the proper order of magnitude, is so inaccurate 
as to be almost worthless. 

The reader will observe that all the Ritz values are larger than the 
corresponding exact values. This is usually the case. The reader should 
also note that more accurate Ritz values were found by taking additional 
terms in (9) and not by correcting previous values. As more terms of (9) 
are taken, the labor of computation increases enormously, so that more than 
three terms will involve an almost prohibitive amount of labor. 


184. Vibration of a Rectangular Membrane. Consider a thin elastic 
membrane of rectangular form with sides a and b (Fig. 36), such as a 

very thin sheet of rubber, and 
2 assume that the membrane is 

made fast at the edges while 
tightly stretched. Take a set 
of three mutually perpendicu* 
lar axes, with the xy-plane 
coinciding with the membrane 
and the z-axis perpendicular to 
it. Then if an interior region 
of the membrane be puUed or 
pushed in a direction at right 
angles to its plane of equi- 
librium (the zy-plane), it be- 
comes distorted into a curved 
surface, the area of which is 



“-/.•rv'+dy+d)-'.* 



Art. 134] 


SOLUTION BY RAYLEIGIMUTZ METHOD 


431 


approximately, since the distortion is small. The increase in area of the 
membrane due to the distortion is therefore 

Let T denote the tension on a unit length of boundary of the membrane, 
the direction of T being perpendicular to the edge of the boundary. Then 
the work done in deflecting the membrane until its area is increased by 
an amount A/9 is T^8 ; * and the potential energy in the deflected position 
is equal to the work done in producing the deflection. Since the deflection 
is small, the tension T remains practically constant. Hence the potential 
energy of the membrane in a deflected position is 

Because of the elasticity of the membrane^ the deflection at any point is 
proportional to the force applied^ and the motion is thus simple harmonic. 
Hence the deflection is a periodic function of the time, or 

z = Z(x, jf) sin wt 


* Consider a rectangular region of dimensions u and v (Fig. 36). First let the 
side AB be flxed and let the membrane be pulled to the right with a force of T 

pounds per unit of width, or Tv for 



Fro. 36 


the whole side. The force Tv will 
stretch the membrane an amount 
Su and do Tv • Au units of work in 
doing so. 

Now let the side BO be fixed and 
let the membrane be pulled in the 
direction of the side AB by a force 
of T pounds per unit length of 
border, or r(u + du) for the whole 
side. The force T{u + Au) will 
stretch the membrane by an amount 
Av in that direction and do 
T{u + dtt) • Av units of work in 


Hence the total work done is 


doing so. 


Tv Au + T(u + Au) ' Av T{vAu + uAv + dtiAo) 
=s T times area of shaded border 
as T times increase in area of membrane. 




4SS 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


On substituting this value of s in the d)Ove expression for the potential 
energy, we get 

The maximum value of this is 

The kinetic energy of an element dm = p dy dx oi the membrane is 

^pdy dx • = ip^y ^ ' ^*(* »y)®* 

where p denotes the- mass of unit area of the membrane. 

The kinetic energy of the entire vibrating membrane is therefore 

Z*dy dx) cos* iut, 

and the maximum value of this is 


(K. E.)«„ = 

Since there is assumed to be no loss of energy due to vibration, the maxi- 
mum potential energy is equal to the maximum kinetic energy and we 
thus have 


or 

( 1 ) 


^ C CzHydx 

«/0 a/O 


We must now assume for Z a linear combination of simple functions 
which will satisfy the boundary conditions of the problem. Such a func- 
tion is 


(2) Z = (s'— x)(6 — y)(oi -f ot* + fl»y + + fliSf* + W + •'*)• 

In order to make the convergence as rapid as possible, however, we move 
tha origin to the center of the rectangle. Then because of ^^mmetry we 
may write 



Aw. 1841 


SOLUTION BY RAYLEIGH-RITZ METHOD 


488 


(8) (p*— **)(8*— + + + )» 

when p = a/8, q = 8/8. 

Awtaming that Z in (1) hu been replaced by (2) or (3) abore, we 
muat determine the a*8 so as to make «* a minimnm. Hence the derivatiTe 
of tbe right member of (1) with respect to each of the a*B must be zero. 
Then by the rule for differentiating a quotient we hare 

-r/;i(iy*(i)'i*‘ w7>*-‘ 

Beplacing 

X7.i(sy*(iyi»* 

in the second term by its value 

I -•//// «■<*»•** 

as found from (1), we have 


r/><-‘ij;7;i(*y7gy('- 

Now taking out the common factor ^ ^ Z*dy dx, we get 


or 


<« s[rr Kiy*®)' i '>«-7.7;-'-‘]-- 


when h = p»V7 * ’ ' "• Formula (4) will give n homo* 

geneous equations for determining n values of k. 

If the form (8) is used for Z, the limits of integration in (4) will be 
from — p to p for a and — g to g for y. 

To get a first approximation to the vibntion frequency of the membrane, 
we take only the first term of the parenthetic polynomial in (8). Then 



434 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Chap. XIV 


Hence 


Z = ai(p*— «*) (?• — y*) 

dZ 

— = — 2axa:(g* — y*) 
^= — 2a»y(p* — X*). 


= 4ai* r r {!*(?* — y*)* + y*(P* — i*)*Wy<*® 

J ^ J -Q 

~ (if T ~ t(tI) ^*^*^^* + 

and 

J^-p *^9 ~ ‘^-P -C ~ **)*(?* — y*)» <fy (te = pYo,» 

On substituting these in (4), we get 


0a 


.- 1 I(tI) 


or 


Q 1 ^ 

f■(P* + 9*)-j| W = 0, 

from which 

5/£l+^\_5/j_ n 

“2\ pY )~Ap*^q^)' 

Replacing p by a/2 and q by h/%, we get 

Since fe = pm^/T, we finally get 


<0= VioV^(^ + ^)- 

The frequency is therefore 


,_JL_ Vio 
'~8» *» 


This is the lowest or natural vibration frequency of the membrane. 

The vibration frequencies found by the classical method of separating 
the variables are given by the formula 



Am. 136] 


COMMENTS ON THE THREE METHODS 


435 


u . = 1 V- 

P 

For m = 1, n = 1, this formula 

Since V 10/2ir = 0.5033, it is evident that the Ritz method gives a close 
approximation to the exact value. A more accurate value could be obtained 
by taking the first three terms of the parenthetical polynomial in (3), but 
the increased accuracy urould be obtained at considerable expense in time 
and labor. 

185. Comments on the Three Methods. Three numerical methods for 
solving boundary-value problems in two dimensions have been considered 
in the present chapter. Each method has its advantages and disadvantages. 
The iteration method is slow, self-correcting, and well adapted to use with 
an automatic sequence-controlled calculating machine. The arithmetical 
operations are short and simple. 

The relaxation method is faster and more fiexible than the iteration 
method. The arithmetical operations are simple, but mistakes are easy to 
make and are not self-correcting. It requires constant vigilance and alert- 
ness on the part of the computer. It is not adapted to use by an automatic 
calculating machine. 

The Rayleigh-Ritz method is of considerable value in handling problems 
of equilibrium and elastic vibrations. It does not require a partial dif- 
ferential equation to start with, but it does require that a physical problem 
be reduced to the definite integral of a sum, difference, or quotient of two 
or more homogeneous positive and definite quadratic forms. The method 
furnishes a short and easy way of finding a good approximation to the 
natural vibration period of an elastic body, deflection of a membrane, etc. 
The chief disadvantage of the method is the laborious algebra involved in 
getting results of high accuracy. 

It is an easy matter to estimate the accuracy of results obtained by the 
iteration and relaxation methods, but this is not the case with the Raleigh- 
Ritz method. No simple and useful formula for estimating the inherent 
error involved in this method has yet been devised. 

A choice between the iteration and relaxation methods would depend 
upon the mechanical aids at the disposal of the computer. If an automatic 
sequence-controlled calculator is at hand, the iteration method would be 
used. If an automatic calculator is not at hand, the relaxation will give 
the desired solution in the shortest time and with the least work. 



becomes 



499 


SOLUTION OF PARTIAL DIFFERENTIAL EQUATIONS [Crap. XIV 


Finally, it must be redind that not all three methoda may be applicable 
to a giren problem. To um the iteration and relaxation methods, a physical 
problem rnnst first be set np as a partial differential equation and this must 
then be oonTerted to a partial difference equation. The Rayleigh>Rits 
method will giro an approximate solution of a problem without setting up 
a partial differential equation, as was done in the cases of the vibrating 
string and vibrating membrane. In problems where all three methods are 
applicable, the Bayleigh-Rits method would probably be the third choice. 

It is needless to say that all these methods are inferior to the classical 
method of separating the variables, but they will give approximate solu- 
tions to problems in which the variables cannot be separated. 



CHAPTER XV 


THE NUMERICAL SOLUTION OF INTEGRAL EQUATIONS 

136. Integral Equations — Definitions. An integral equation is a func- 
tional equation in which the unknown function occurs under the integral 
sign as well as outside it. The simplest type imaginable arises from the 
integration of the simple differential equation dy/dx=zf{x,y), with the 
initial condition y = yo when x = Xo. l^e result is 

y = r /(*.y)dx + C= f*f(Xyy)dx + yo, 
as stated on page 317. Two important types of integral equations are 

( 1 ) ^(x) = J^*K’(x,0^(Orf« 

and 

(*) ♦(»)=/(*) + 

Here the functions K{x, i) and /(x) are known and ^(x) is the unknown. 
K (x, t) is called the kernel or nucleus and is assumed to be a continuous 
function of x and t throughout the interval (a,b); that is, a^x^b, 
a^t^b. In physical problems the kernel is usually Green’s function. 

Equations (1) and (2) are called linear integral equations because the 
unknown function ^ occurs to the first degree. Also, (1) is called a homo- 
geneous equation and (2) is called a non-homogeneous equation. 

An integral equation of the form 

(3) ^(x) =/(x) + fy(x,t)F[t,ib(t)]dt 

is called a non-linear integral equation because the unknown function A 
does not occur in a linear fashion. 

To solve an integral equation of any type is to find the unknown func- 
tion ^(x). In some cases this can be done by the method of iteration, 
by starting with an approximate value for ^(x), substituting it in the 
integrand, and performing the integration. The new value of ^(x) is then 
substituted in the integrand as before and the process is repeated until 
no improvement is found in ^(x). In general, however, the solution of 
integral equations by exact anslytical methods is not easy. Hence it is 
necessary to fall back on approzimato solutiens by numeriMl methods. 


4S7 



488 NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 

Several methods have been proposed for finding numerical solutions of 
integral equations. One of the simplest and most direct is the method 
suggested by Goursat ^ and later developed and extended in various direc* 
tions by Nystrom.* The method explained in the following pages is a 
modification and amplification of Nystrom’s method. The method consists 
essentially in replacing the unknown function under the integral sign by 
a polynomial of some form, integrating this polynomial over an interval, 
and then evaluating the integral at certain specified points within the 
interval of integration. 

Before proceeding to the numerical solution of integral equations, we 
make a short digression to indicate how integral equations can arise from 
simple problems, and particularly how the kernel gets into the equation. 

137. Boundary^Value Problems of Ordinary Differential Equations. 
Green’s Functions. In the elementary treatment of differential’ equations 
the function and all its derivatives are assumed to be continuous through- 
out the interval of integration. The general solution based on thesf‘ 
assumptions is not as general as one might suppose. A few simple exanipU., 
will suffice to show this fact 

Example 1. Suppose a solution of the differential equation d^y/dx* = 0 
is required such that y = 0 for 2 = 0 and x = 1. Proceeding by the usual 
method, we have 

dy 

^ = cj, y = cx* + c,. 

Substituting the conditions given above, we find c, = 0 and c, = 0. Hence 
the solution is 

y = o, 

the equation of a straight line through the points (0, 0) and (1, 0). 

This solution is trivial and is not the only solution which will satisfy 
the given equation and the given conditions. Since the solution must be 
of the form y = Ax 4* the graph of which is a straight line, it is evident 
that a solution might consist of two linear functions whose graphs would 
pass through the respective end points and intersect at some point x = s, 
as shown in Fig. 37. We therefore attempt to find such a solution. 

Let the two linear functions be 


( 1 ) 


yi = Ax 



Am. 187] 
•nd 
(*) 


BOUNDARY-VALUE PROBLEMS 


489 


y, = C(l — *), 


which evidently satisfy the respective end conditions. A third condition is 
that yi = y, at X = s. Hence from (1) and (2) 

(3) As=:C(l—s). 

A fourth condition becomes evident when we look at the graphs in Fig. 37. 



The slopes of the two lines are different at the point of intersection, or for 
X = s. Hence the first derivative of the required function is discontinuous 
at X = s, the amount of the discontinuity (difference of slopes) being any- 
thing we please but evidently depending on s. Call it k(s). Then 

<*> + 

Solving (3) and (4) simultaneously, we get 

A=(l — s)k(s), C = sk(s), 

the arbitrary constants ” A and C thus depending on s. Substituting in 
(1) and (2) these values of A and C respectively, we get 

(5) yi = i:(s)(l— s)x, y, = k(s)s(l — x). 

In this and similar problems it is customary to put k{t) = 1. Hence 
the final solution is 


( 6 ) 


Cy=:(l — s)x for O^x^s 
|y = s(l — x) for s^x^l. 


440 


NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 


The retder will note that by discarding the usual assumption of eon* 
tinuous derivatiTes in this example we have been able to find a worth-while 
solution which is everywhere continuous and satisfies the given boundary 
otmditions. On putting h{s) = 0 in (6) we get the trivial solution first 
found. The solution (5) is thus more general than the usual ** general ** 
solution. 

The solution (6) may be written as the single equation 

y = i’(»,s), 

where 

£’(*, s) = (1 — 8)x for 0 ^ X ^ s 
jr(x,s)=s(l — x) for s^x^l. 

The function K(8, x) is called 0re8n‘8 Function for this example. It is 
a function of the two independent variables x and 8 in the interval (0, 1) 
and is evidently symmetrical in those variables. Green’s function in this 
case is thus the solution of the differential equation = 0, with the 

given boundary conditions. 

Example B. Bequired the solution of 
(1) y'’(x)-o*y = 0, 

with the boundary conditions y = 0 when x = 0 and x = 1. 

Solution. By *the usual elementary method of solving such equations, 

we have 

r* =: o*, or r = ± o. 


Hence the general solution is 

(2) y = c,s--f-c,r“ 

= A cosh ax -j- B sinh ox. 

Substituting in (2) the values y = 0, x = 0 and y = 0, x = 1, respectively, 
we get 

0 = A 

0 = il cosh a -\-B sinh a 

.*. BBinha = 0, or B = 0 (since we assume a t^O). 

Hence (2) becomes 

2 = 0 , 

another trivial solution. 

To get a worth-while solution of this example we assume a function of 



Abt. 137] 


BOUNDARY-VALUE PROBLEMS 


441 


ih« fonn (2) for each end point of the interval x = 0 to x = 1. However, 
since cosh 0 7A o, it is plain that the assumed functions need not contain 
oodi ox. Hence we take 

(8) jri = il8inhax and yt = JBsinha(l — x), 

where we have now utilized the boundary conditions in writing down these 
functions. See Fig. 38. The graphs of these functions will evidently 


Y 



intersect at some point where x = s, and at that point the functions will 
be equal and their first derivatives will be unequal. Hence for x = s we 
have from (3), 


(4) A sinh as = B ainh a(l — s) 

(6) ^ — ^* = 40 cosh os + Bo cosh 0(1 — s) 

From (4), 

(•) 

' ' sinh os 

Subatituting this value of A in (5), we find 

sinh os 


= 1, say. 




a ainh o 


442 NUMERICAL SOLUTION OP INTEGRAL EQUATIONS 

Hence from (6)> 


j _ 8inha(l — s) 
asinha 

Substituting in (3) these values of A and B, we get 


[Chap. XV 


_ sinh g (1 — s) sinh ax 
a sinh a * 

_ sinh as sinh tt(l — x) 
a sinh a 


These can be written as a single solution in the form 


where 


. sinha(l — s) sinh ox . 

K(s, x) = i — r-^r for 0 ^ x < s 

' ' osinho 

. sinh os sinh 0(1 — x) . ^ 

K{s, x) = r-j-^ for s ^ X < 1. 

' ’ ' osinho — 


The reader will notice that Green’s function takes care of the boundary 
conditions. 

The following example shows how a differential equation with boundary 
conditions can be transformed into an integral equation. 

Example 8, Solve the differential equation 

(1) 0=/w. 

subject to the conditions that y = 0 when x — a and y = 0 when x — h. 
Solution. From (1) we have 


/(x)<ix + Cl = f(s)ds-\-Ci 
and 

(2) y = m: f{s)da^ dx -f CiX C*. 


At this point it is well to look at this double integral from a geometric 
standpoint. 


The double 


integral s:{f: mds) 


dx may be looked upon as the 



Abt. 137] 


BOUNDARY-VALUE PROBLEMS 


443 


evaluation of the function /(«) over the region (area) ALP of Fig. 39, 
by first integrating over the vertical strip MN with respect to s and then 
finding the limit of the sum of such strips by integrating with respect to x. 
Since x and « are both continuous throughout the region ALP, however, 
the double integral may equally well be evaluated over this region by 



integrating first over the horizontal strip RQ with respect to x and then 
finding the limit of the sum of these strips by integrating with respect to t. 


In this latter case the double integral would be 
Hence we have 




da. 


( 3 ) 


X’(X' '<*>*) ‘^=X.'(X 

da = {x — a)t{a)da. 


It is thus seen that the substitution of an equivalent integral for the given 
one enabled us to perform one integration and thereby reduce a double 
integral to a single integral. 

Now replacing the integral in (2) by the single integral given in (8), 
we have 




444 NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [ChAF. XV 

(4) (*“*)/(*)* + ^** + ^** 

To find Cl and Ct we aubstitute in (4) the giyen boundary oonditiona 
« = y = 0 and s = 6, y = 0. Hence we hare 

0 = 0 + CiO + C, 

0 = ^ (6 — »)f{»)ds + Cib + Ct. 

Solving these equations for Ci and Ci, we get 

f *(6 — 5)/(«)<b o f*(6 — »)/(«)d« 

Hence (4) now becomes 

X C* ib — 8)f{8)ds o r*(b — «)/(«)* 

(5) , = (X -.)/(.)■&- 

=//(*-•)/(•)* +|Ef// (»-»)/(*)*• 

For the purpose of this example we transform the second integral by the 
well-known relation J = J + J Then we have 

y(*) = J* (*—»)/(»)*+ (ft— »)/(«)<*« 

=// (*-»+fEf<‘-*))«*)*+X‘ /wd.. 


or 


<‘) »<*)=/; 

which may be written 

(7) y(*) = (*, 8)f(8)d8, 


where 


K{x,8)=i — — ^ for 

Jf(g,s) = for s^*^b. 

Here, again, we note that K{x, $) is qrmmetrical in s and «. 


Art. 138] 


LINEAR INTEGRAL EQUATIONS 


446 


The final result (7) is a simple integral equation, and we have thus 
transformed a simple differential equation with boundary conditions into 
a simple integral equation. But we still have not found y; for in (7) 
y is expressed in terms of itself, the f(s) under the integral sign. 

It is instructive to check the correctness of (6) by differentiation. To 
do this we must use the formula for differentiating under the sign of 
integration. If 


then 





dXi 

la‘ 


Applying this formula to (6) and treating x as a, we have 


iy= C 

dx Ja 






» s — b 


f(s)ds- 


(x — o) (g — 6) 


6_a 6 — 0 
the second and fourth terms canceling each other, and 




dx* 


= 0 + 


X — o 
6 — a 


/(*)+0 


X — 6 
6 — 0 




X — O — g 4- ^ 
6 — o 


/(*) = 


b — a 
b — a 


fix) =/(»), 


which shows that (6) is correct. 


138. Linear Integral Equations. We shall first consider the linear 
equation 

(1) <f,{x) =f(x) + J^^K(x,t)d>it)dt. 

Since a definite integral can be closely approximated by any one of several 
quadrature formulas (each of which was derived by integrating a poly- 
nomial over an interval), it is evident that the definite integral in (1) can 
be replaced by a quadrature formula, so that (1) may be written in the form 

(2) ^{x) =f(x) -f- (6-o)[C|ff’(x,f,)^(/x) • • • 


where ti, tj, ‘ are subdivision points of the interval (o, 6) and the 
C*B are weighting coefficients whose values depend on the type of quadrature 
formula used. And since (2) must hold for aU values of x in the interval 



4M NUMERICAL SOLUTION OP INTEGRAL EQUATIONS [Chap. XV 

(o, h), it must hold for x=:ti, x = * * ■ , x = <». Hence from (2) 

ve get n equations of the ^ype 

(3) «(<o =/(«,) + (6-a)[(7»r(<i,<o^(<i) 

+ • < = l,2,* -n. 

For brevity let us put ^(ii) = and f{ti) = /«. Then the system (8) 
becomes 

— /i *{■ ti)^i 4" ' ■ ■ "h 

... = /* 4* (b — a)[Ci^(<i, <i)^i 4" ^t)^i 4" ” ■ 4* OnK{tt, ^it)^ii] 


t — /» 4 " (b — 4“ U)4^ 4 * ‘ ■ 4 “ OnK(tnf tn)^ 

Equations (4) are a system of n linear equations in the n unknowns 
4i» ^ and can be solved for these unknowns by the usual methods 

for solving systems of linear equations After the ^’s have been found, 
they are substituted in the right-hand member of equation (2). The 
result is the desired solution of the given equation (1), since (2) will then 
give ^(x) for any value of x. 

The reader will note that in a quadrature formula the functional values 
®t the points of subdivision are given, whereas in integral 
equations these values must be found as part of the process of solving the 
equation. We get the equations for finding them by putting x = ti, U, etc. 

Because of the difficulty of solving a large number of simultaneoiu linear 
equations, it is highly desirable that only a small number of functional 
values (^*s) be computed. In important problems the formulas of Qauss 
and Lobatto should therefore be used. 

Example 1. Solve the integral equation 

u(x) = f - 14- !//(< 4 - x)u{t)dl. 

Solution. In this simple example we evaluate the integral by Simpson’s 
Rule, taking n = Z ot h = {. Then 

(6) it(x) = I' - 1 4- 1- • j • I iiti 4- xMti) 4- 4(<, + x)u{t,) 

+ (<,4-x)tt(<,)]. 

Since (6) must hold for all values of x from 0 to 1, it holds for 
X = < 1 , ft, tv Hence from (6) we get 



Abt. 138] LINEAR INTEGRAL EQUATIONS 447 

~ 9^“ + 4(<a + <i)tt(<2) + (^« + 

= "g 9 ”^18 “i" (^*“1“ ^i)**(^»)]» 

^(^») ~ ^ l¥ + 4(^2 + it)u(t2) + 2<8tt(<2)]. 

Now putting ti = 0, t 2 = i, <2 = 1 and writing tt(<«) = U{, we get 

«i = — |+^[2tt, + u,] 

^ ^ [HWi + 4tt2 + %**i] 

5 1 

«» = j + [til + 6tt* + 2tts]. 

Clearing of fractions and transposing the u’s to the left side of the equa> 
tions, we have 

36iix ““ 4ti2 2ii2 4 

— Ui + 28 u 2 — 3tt2 = 11 
““ 2Ui ISiia “f" 32ti2 96* 

On solving these equations by determinants (Cramer’s Rule) or otherwise, 
we find 111 = 0, Ua = i, v, = 1. Then substituting in (6) these values 
of the u’s and putting <i = 0, <, = <2 = 1, get 


«(i)=^-i+i[0 + 4(l + *)(l) + (l+ar)(l)l=x. 


This result can be checked by substituting it in the integrand of the 
original equation and performing the integration. Thus, putting tt(<) = t, 
we have 


«(x)=^-l+i-J'^' (< + x)(d( = ^ 


6 

to 

6 


9 + 9+6 =*• 



Simpson’s Rule gives the exact result in this example because the integrand 
is a second-degree polynomial in t. 

Example £. Solve the integral equation 

(6) u(*) =2x-l-|J’^ 

by OauBs’s formula, using three points of subdivision. 



448 


NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 


Solution. We must first txuisform the equation so that the new limits 
of int^;ration will be from — i to Hence we put 


< = a + X = e -f 


Substituting in the first of tfiese equations the corresponding limits t = — 8, 
V = — I and < = 8, V = we get a = ( = 5. Hence the equation of 

transformation is 


Likewise, 


< = 1 + So. 
X = ^ -f* fiv. 


Substituting in (6) these values of t and x, we have 


a(J -|- 6w) = 1 + lOw -f (» — v)«(i + 6o)6di>. 
Now put tt(| + = ^(w), tt(| + 6v) = ^(o). Then 

^(w) = 1 + lOw 4" 8^* {w — v)^(v)dv. 
Replacing the integral by Gauss’s formula, we have 


(7) ^(w) = 1 + lOw 

+ 5[Ri(«» — ei)^(«i) +«*(«»-- «i)^(o*) 4* *»(w — «»)♦(*«)]• 

Now since (7) must hold for all values of to from — i to it must hold 
for tv == Vi, w = V 2 , tv = Oi. Hence on substituting in (7) these values 
for tv we get the equations 

^(e,) = 1 + lOv, + 5[Ra(ej — e,)^(v*) + Ni(t>i — v,)^(o,)] 
♦(vi) = 1 + lOv* -f- 5[Ri(t>t — ■4"^»(®a — *•)♦(*'»)] 

^(v,) = 1 + lOv, + — ei)^(vi) + f2a(Ui — tii)^(t;,)]. 

For the Gauss formula with three points of subdivision we have 

= '• = W|-. «. = ^. «.=f. *. = ^- 

On substituting these values in the equations above and writing for 
^(oi) etc., we get 

♦. = I - vl» + 5 [|(- i V|) ♦. + - Vf ) ♦.] 

♦. = 1 + Vl5 + 5 [A(V|) ♦. + !■(* Vi ) ♦. ] 


( 8 ) 



Abt. 138] 
or 


UNEAR INTEGRAL EQUATIONS 


449 


( 9 ) 


^+|VI6^, + ^V16^ = 1 — VW 
^ VT5 ^, + 1 Vis ^, = — 1 — vTH. 


Solring these equations by determinants, we find 


- — ^(19 
**- 37 

«. = — ^(19 


+ 9V16) 


— 9V15). 


As a partial check on the correctness of these results we notice that by 
adding the first and third of equations (8) and comparing the sum with 
the second equation we get -f = 2^t, and this is true of the values 
found above. 

Now substituting in (7) these values of the and the numerical values 
of the R*b and v’s, we get 


( 10 ) 


^(w) = 


180u; 

37 


37 ■ 


Since tv = 


izii 

5 


, the final solution of (6) is 


tt(a;) = 


(g— i) 

37 5 


3®. 

37 


36g 56 

37 37 


= -^(9x-14). 


This solution can be checked by substituting it in the original equation 
(6), as shown below: 

«(») =»x + i- J ‘ (* - <) ^ (»< - 14)«« 

= ** + Tsr r*(9i<— ite— 9«+l4t)(»< 

lo5«/ .f 

4 r9x<* n* 


- 88 56 _ 86* 66 _ 4 .. i.ix 

— ** 87® 87 87 87 87 ^*® 



460 NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 

189. Non-Uiiear Integral Equations and Boundary-Value Problems. 
Many boundary-value problems lead to non-linear integral equations of the 
^rpe (3), Art. 136. The unknown function ony func- 
tion of oiich as sin^(i), etc. In such problems the ^*s 

cannot be found by solving a system of simple linear equations as was done 
in the examples worked in Art. 138. It is possible, however, to find a 
system of equations which give the in terms of the functions F[t, 

The ^*s can then be found by the process of iteration as was explained in 
Art. 80. 

Before proceeding with the solution of such non-linear integral equations 
we return to a further discussion of boundary-value problems. We con- 
sider first the second-order difierential equation 

(1) /'(*) = 0 = AA(x)y + /(*), 

with the boundary conditions y = 0 when z = a and y = 0 when z = b. 
These conditions are usually written y(a) =y(b) =0. The functions 
A(z) and /(z) are assumed to be continuous in the interval (o, b), and 
X is an arbitrary constant or parameter. 

Equation (1) can be reduced by direct integration^ to the integral 
equation 

(2) y(z)=xj*^* E^(z,s)A(«)y(5)ds-f K{x,s)f{s)ds, 
where 

K(x,8) - for a^s^r 

K(x,8) - for x^s^b, 

but we shall here simply verify by differentiation that (2) is the solution 
of (1). Since K{x,s) has two different values, we write (2) in the 
equivalent and extended form 

_ j** (»— [Ai4(»)y(.) + /(«)]* 

+ + /Wl*- 

* Lovett's Linear Integral Eqnatione, p. 82; Goursst's Ooure 4* Analyse, III, p. 494. 



Abt. 139] NON-LINEAR INTEGRAL EQUATIONS 4S1 

Now applying the formula for differentiating under the sign, we hare 


/(*) = // + «•)]* + (*)»(*) + /W1 

+ j‘ ^A;«Wy(») + +/WI 

the integrated terms canceling each other, and 

y"(a:) = 0 + ^^[)ul(x)y(x) + f(x)] + 0 — y^[Ail(®)y(a:) + f(x)] 

= [wWiKi) +/W] 

= [*j* (*)»(») + /(*)] . 

or 

/'(*) =XA(®)y(*) +/(*), 

which is Equation (1). 

Let us now consider a more general boundary problem defined by the 
differential equation 

(3) 0 = r{x) = F[x, <^(x)] + g{x), 

where F\x,^{x)'\ represents any continuous function, g{x) is a given 
function, and the boundary conditions are ^(a) =:^(b) =0. This dif- 
ferential equation with the given boundary conditions is equivalent to the 
integral equation 

(4) ^(x) = f V(x,s)F[s,^(s)]ds-f J^*jr(x,s)y(s)<i», 
where 

K{x,s) = o^s^x 

K{x,s) = for x^s^b. 


We shall verify this fact by direct differentiation of (4). 
Writing (4) in the equivalent form 



46S NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 

+/; 

=X* +»(•))* 

+ X* {P[^ ♦(,)] + ,(,))*, 

we differentiate the equation with respect to x, as in the preceding example. 
Thus 

♦'(x)=X' |Ef(n«.*(«)] +»w)*+^^^z^=^{p[*.*(x)] +»(*. 

+ X* + +>W). 

the integrated terms canceling each other, and 

4,'Xx) = 0 + (F[x, 4>(x)] -f- g(x)) + 0 — (^’[x, ^(x)] + p(x)) 

=(nx,*w]+»w)(fEj-|E;) 

= (F[x, ♦(*)] + i>W)(^) = n*, ♦(*)] + JW> 

as was to be shown. We therefore solve the differential equation (3) by 
solving the equivalent integral equation (4). 

Since it is desirable to have a small number of equations for determining 
the 4)% and yet a high degree of accuracy is desi’‘able, we restrict the 
function F[x,^(x)] to the cases where F[o,^(a)] =0 andF[&,^(&)] =0. 
This restriction enables us to employ the subdivision points called for in 
Lobatto’s formula for five functional values, thus giving a high degree of 
accuracy and yet causing the two end values to drop out. We thus have 
only three interior points to consider and therefore only three equations 
for determining the ^*8. Although we can use the subdivision points 
required by Lobatto’s formula, we cannot use the quadrature formula itself, 
because that formula was derived by integrating a single function through- 
out the interval from x = a to x = b, whereas in the problem before ua 
there are two different functions. We therefore replace the function 
F[x,^(x)] over the interval (a, b) by a fourth-degree polynomial given 
by liagrange’s interpolation formula. 

To adapt the interval of integration as required for Lobatto’s points of 



AST. 139] 


NON-LINEAR INTEGRAL EQUATIONS 


46S 


gubdivision, we must change the limits of integration to — ^ and as 
was done in Ez. 2, Art. 188. We assume that the inteT^'al has thus been 
changed. The subdivision points for Lobatto’s formula are then Si = 0, 

s, = - i vVr , «. = i V SA S4=—hs, = i. 

I 1-3:: 1 1-3 1 

S4 = — i s, = — iV8/7 S4 = 0 s, = iV3/7 s, = i 


Lagrange’s formula for F[s, ^(s)] for these live functional values is 
therefore 


no,«.] 


(6) #■[.,♦(.)] = -^ 7 - 7 ^ 7 ik 

«)(-!) 

w(*-WD(» + i)(»-l) 

(-W|)(-WhW|)(-iV|+i)(-Wf-i) 


(.)(>-W;)(« + l)(«-l) ,3 

the terms in F[— i)] and F[i,^(i)] not being written because 
they are zero. When the terms in (5) are multiplied out, we get 



Equation (6) gives P[— i, ^(— i) ] = 0 and F[ J, ] = 0 as it should. 

The next step is to substitute (6) in (4) and then integrate over the 
intervals — J to x and x to i, using the appropriate value of IC(x,s) in 
M c h case. Thus, since a is now — J and 8 is | we have 



464 NUMERICAL SOLUTION OF INTEGRAL EQUATIONS (Chap. XV 

o) ♦(*)= r'(F-i)(»+i)n».*(«)]* 

-H 

+ (® 4- i) (» — + /(*), 

where f(x) stands for C K{x,s)g(s)ds and ^[s, ^(s)] stands for the 

-H 

right-hand member of (6). 

In carrying out the integration in (7), x is treated as a constant; and 
although the integration is perfectly straightforward, it is long and tedious. 
The result is 


/o\ \ _ 112 , v/** 5x* . 3x* 9 \ 

(8)^(®)_ g 168 ■*’224 -4480/ 

3 ' 40^7 48 48 ^7 1920^7^1280/ 

_ H Ji , + £l _ £l JL . J£_ JI + _ 1 _) 

3 '*^7’’^*^\30^40^7 48 48 ^7^1920 ^7^ 1280/ 

+ /(*). 


3 7x 
7 ■•■1920 


This formula (8) gives the complete solution of the integral equation 
(4), or the differential equation (3), for any function F[x,^{x)] as soon 
as ^ 1 , ^ 2 f ere known. 

To find these 4>\ we evaluate (8) for ® = 0, x = — “<1 

Denoting ^(*i) by /(*i) by /„ etc., and doing some 
tedious arithmetic, we get 


or, with decimal coefficients. 



Aet. 139] 


NON-LINEAR INTEGRAL EQUATIONS 


466 


( 10 ) 


= — 0.075000000 F{0, tfn) — 0.025520833 P{— i 

— 0.025520833 F(i Vf. ^.) + A 

^, = — 0.032653061 F(0, — 0.030952381 F(— 

-0.007142857 F(i >/f- » ♦») + A 

— 0.032653061 F(0, ^0 — 0.007142857 F{— ^ V|-» ♦«) 
= — 0.030952381 F(i ^|^ , ^,) + 


Since /(x) can be easily found in any given problem, the ^*8 can be 
found from (9) or (10) by the process of iteration if approximate values 
are known at the start. Then the desired solution of (4) is found by 
substituting the ^’s in (8). 

The reader should observe that in the special case where F[x, ^(x)] 
= ^(x), equations (9) and (10) will give the ^'s in a system of linear 
equations as in Art. 138. A differential equation of the form 

♦"(I) = ♦(!) + Hz), 

with ^(a) = ^(6) = 0, can thus be solved by means of (9) or (10). 

Example. Solve the differential equation 
( 11 ) ^(») — 1 » 
with the boundary conditions ^( — |) =^(^) =0. 

Solution. Here F[x,^(x)] = 8in^(x) and g{x) — — 1. Since ^(± ^) 
= 0, 8in^(± i) = 0 and we may therefore utilize equations (9) or (10). 

To get approximate values for ^ we assume that the solution of (11) is 
not very different from the solution of the similar equation 


(M) 

♦"(*) = ♦(*) — 

or 


(18) 

1 

II 

1 


Solving this by the usual elementary method, we have 


r* — 1 = 0 or r=±l. 



466 NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chaf. XV 

Hence the complementaiy function is 

= A coah einh x. 

To find a particular integral we asaume ^ = (7. Hence 
Subatituting theae in (13), we get 

— C = ~-l or C = l. 

Hence the general aolution of (12) ia 

(14) = A coah x-}-B ainh 

Now aubstituting in (14) the given boundary conditiona ^(db'|)=0, 
we have 

0 = cosh I — B ainh i -f 1 
0 = v4 cosh ^-{-B ainh } + 1. 


Solving these for A and B, we get 


A = 


Then (14) becomes 
(15) 


♦ = 1 


cosh \ ’ 
coshz 


B = 0. 


cosh i ’ 


We now find by putting x, = 0, Zj = — J and x, = 1 Vy 

in (16). Then 


, ^ coshO 

— 1 . A .» 0.11318. 


cosh 0.5 
cosh (— i-^1) 


— 1 — 


coah 0.5 
1.054051446 


= 1 


cosh 0.3273268 


cosh 0.6 


— 1 


1.127625965 

cosh (1 V|) 


= 0.065247. 


= 0.065247. 


cosh 0.5 

The next step is to evaluate the integral giving f(x). We have 
f(x)= f'^B(x,s)ff(s)ds. 


Since g(x) = — 1, we have 



Art. 138] NON-LINEAR INTEGRAL EQUATIONS 


467 


8 2 


Hence 


fM=L, = /(..)=l_>(i) = ±. 

Since F[«,^(x)] = sin ^(x), equations (9) become 


(16) 


^, = - — sin^, 


49 


^2 

<^n = 


16 


sin 


1920 

13 


490 

16 • . 

Sin ih\ 

490 140 


420 

1 


sin ^ 2 ' 
sin ^2 - 
sin ^2 


49 


1920 
1 


sin <!», -f- fi 


140 

13 


420 


sin <^3 + /* 
sin ^2 


We now substitute in the right-hand member of the first equation of (16) 
the numerical value of fi and the approximate values of ^ 2 > found 
above. Then we have 


Q 40 

=-;^ sin (0.11318) sin (0.065247) 

49 

— sin (0.065247) -}- 0.125000 
1920 ' ' 

— _ 0.0084704 — 0.0033279 -|- 0.1250000 
= 0.11320. 


Now substituting this value in the second equation of (16), u'e get 
sin(0.11320) -^sin(0.065247) 

— sin (0.065247) -}- 0.07142857 

= — 0.0036884 — 0.0020181 — 0.0004657 + 0.07142857 
= 0.065256. 

Also, 

= 0 . 066266 . 



4S8 


NUMERICAL SOLUTION OF INTEGRAL EQUATIONS [Chap. XV 


We now repeat the above process by substituting in 

the right-hand member of the first of equations (16). Then 

3 40 

= — ^8in(0.11320) __ sin (0.065256) 

49 

“liio ®‘"(® 0S5256) -I- 0.125000 

= — 0.0084719 — 0.0033284 -{- 0.1250000 
= 0.11320. 

Now substituting this value in the second of equations (16), we have 


= — ;^ sin (0.11320) 8in(0.065256) 

— sin (0.065256) -f 0.07142857, 
or 

= — 0.0036884 — 0.0020184 — 0.0004658 -|- 0.07142857 
= 0.065256; 
and 


= 0.065256. 

Since these values of the ^’s are the same as the preceding set of values, 
we take them to be correct. 

The required solution of equation (11) is now obtained by substituting 


1 X 

in (8) the values of ^ and /(x) found above. Thus, since /(i) = - — — 

o A 

ud F[0,*.], p[ -Wf .♦.] • ^[l Vf. ^sjare to be replaced by 
sin^i, sin ^ 2 , and sin^s* respectively, we have 


112 / *• 
♦(*)=^(0.im58)[^ 


{io- 

UJU 

'T^ 

1 UJU 

•7 

4480 , 

) 




X* , , 

48 "^48 

J 3 


73 + 

40 ^ 

*7 

^7 

1920 ^ 

*7 ^ 

I* 

^40 


X* X* 

\/i 4 


Jh 

^'7 

48 “48 

^7 ^ 

- 1 
1920 


u 
1280 ^ 


-L-) 

1280/ 


^8 2 ’ 


or 


(17) ^(x) = 0.11320 — 0.44352X* — 0.03675x* — 0.001443x*. 



Am. 139] 


NON-LINEAR INTEGRAL EQUATIONS 


459 


At a partial check on thia result we find by tnbstituting a = — ^ and 
in (17) that ^(:±: i) =0, at it should be. To check the result 
completely, it is necessary to go back to equation (6) and replace F(0, ^), 

^(— V| *^a) by sin^t, sin^, and sin^s> retpec- 

tirely. The result is then substituted in equation (4). The int^pmtion 
is then carried out over the intervals — } to a and x to the appropriate 
value of E (x, s) being used in each case. Note that the second integral 
1 X* 

in (4) is /(x) = g ~ > which has already been found. This complete 

check has been carried out for this problem, the right-hand member of 
(4) giving the right-hand member of (17) above. 


30 



CHAPTER XVI 


THE NORMAL LAW OF ERROR AND THE 
PRINCIPLE OF LEAST SQUARES 

140. Errors of Obserration and Measurement. All measarements are 
subject to three kinds of errors: constant or systematic errors, mistakes, 
and accidental errors. Systematic errors are those which affect all measure- 
ments alike. They are mostly due to imperfections in the construction 
or adjustment of instruments, the "personal equation^* of the observer, 
etc. Such errors are usually determinate and can be remedied by applying 
the proper corrections. 

Mistakes or blunders are large errors due to careless reading of measuring 
instruments or faulty recording of the readings. They consist mostly in 
reading the wrong scale, reading a vernier backward, making a miscount 
in observations which involve counting, putting down the wrong number 
when recording the readings, etc. Mistakes do not follow any law and 
can be avoided or remedied only by constant vigilance and careful checking 
on the part of the observer. 

Accidental errors are those whose causes are unknown and indeterminate. 
They are usually small, and they follow the laws of chance. The mathe- 
matical theory of errors deals with accidental errors only. 

141. The Law of Accidental Errors. In order to get a better under- 
standing of the behavior of accidental errors the reader should try the 
following experiment: 

Take a sheet of ruled paper and draw with pen or pencil a line bisecting 
the space between two rulings near the middle of the sheet, as shown in 
Fig. 40. Lay the sheet flat on a table or floor, with the rulings upward. 
Now take a sharp pointed pencil, hold it lightly by the top between the 
finger tips of both hands, and about two feet above the paper. Take good 
aim at the line on the paper and try to hit it by dropping the pencil on it. 
Drop the pencil in this way at least 100 times, making an honest effort to 
hit the line every time. The shots will be self-recorded as dots on the 
paper. Count the dots in the compartment (space between the rulings) 
containing the target line, and the number in each of the other com- 
partments on each side of the central one. Plot a curve by using as 
abscissas the distances from the target line to the midpoints of the 
several compartments containing dots, and as ordinates the number of dots 
in the corresponding compartments. 


460 



Art. 141] 


THE LAW OF ACCIDENTAL ERRORS 


461 


An experiment of this kind gave the results recorded in the table below. 
These results are plotted in Fig. 41 . 



Compertment 

No. of dote 

3 

1 

2 

6 

1 

31 

0 

53 

-1 

32 

-2 

6 

-3 

1 

ToUl 

130 








462 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


If the pencil had been dropped 10,000 or more times instead of 180 
and the width of the compartments correspondingly decreased, the plotted 
points would have followed the curve shown in Fig. 41. This curve is 
known as the Norfual Probability Curve. Its equation will be derived in 
Art. 143. 

All kinds of accidental errors follow the same law as the pencil shots 
in this experiment. 

Itf. The Prohability of Errora L 3 ring between Given Limits. In 
many applications of the theory of probability it is necessary to find the 
chance that a given error will lie within certain specified limits. In such 
cases we utilize the fact that the probability that an error lies within given 
limits is equal to the area under the probability curve between those limits. 
The following proof, while not altogether rigorous, is sufficient to show 
the truth of this statement. 

Going back for a moment to the target experiment of Art. 141, we recall 
that in plotting the results we erected ordinates at equal distances apart 
along the x*axis. The height of each ordinate was made proportional to 
the number of dots falling within the corresponding interval on the target. 
If we imagine rectangles constructed with the equal intervals along the 
«-axis as bases and the corresponding ordinates as altitudes (see Fig. 42), 


Y 



fm. a 


Art. 142 ] ERRORS BETWEEN GIVEN LIMITS 463 

we readily see that the area of each rectangle is proportional to the number 
of dots falling within the corresponding compartment. Thus, if Ni is the 
number of dots in any compartment and Ai is the area of the corresponding 
rectangle, we hare 

( 1 ) Ai = k,Ni. 

Now if we make one more attempt to hit the target line in the experi- 
ment of Art. 141, the chance of hitting within the central compartment 
is about 63/130, that of hitting within the next compartment to the right 
is about 31/130, etc. The chance of hitting within some one of these 
compartments is therefore 

— -L— j_— a.— . -L. — —1 

Since the chance of hitting within any compartment is proportional to 
the number of hits made in a large number of shots, we have for any 
compartment 

( 2 ) pt = tgNi, 

where pt is the probability that a single additional shot will fall in any 
compartment in which Nt shots fell in a previous experiment. Eliminating 
between equations (1) and (2), we get 

( 3 ) = 

which shows that the chance of making a hit in any compartment is 
proportional to the area of the corresponding rectangle. The chance of 
hitting within some compartment is therefore 

(4) , = l=,. + ,, + - ■ (4,+4, + - •)=^24. 

Now when the number of shots is increased indefinitely and the width 
of each compartment on the target is correspondingly decreased, it is 
plain that the bases of the corresponding rectangles will likewise decrease 
and that the sum of the areas of these rectangles will approach the area 
under the probability curve as a limit. The area under this curve is 
always finite, and since it represents the probability that a shot will fall 
somewhere, it (the area) represents certainty and therefore may be taken 
as 1 ; or lim 2A = 1. Hence by (4) we have 


1 = 1 ^ ( 1 ). or kt — ki. 



464 


THE NORMAL LAW OF ERROR 


[Chap. XVl 


Equation (3) now becomes 

( 6 ) = 

which shows that the chance of making a hit in any compartment is equal 
to the area of the corresponding rectangle. 

From equation (5) we have the important result that the chance of 
making an error whose magnitude lies between x and x is * 

( 6 ) p = yi^, 

where y is the ordinate to the probability curve. The chance of making 
an error whose magnitude is between Xi and x, is therefore 

(7) p — Hm 2 yAx = I ydx 

Ax '►o 

143. The Probability Equation. To derive the equation of the Proba- 
bility Curve we make use of the following facts as to the distribution of 
accidental errors, as indicated by the table of Art. 141 and the corre- 
sponding curve: 

1. Small errors are more frequent than large ones, which shows that the 
probability of an error depends upon its size. 

2. Positive and negative errors of the same size are about equal in 
number, thus making the probability curve symmetrical about the y axis. 

3. Very large accidental errors do not occur. 

These three fundamental facts are so self-evident that they may he taken 
as axioms. 

From axioms 1 and 2 it is plain that the ordinate to the probability 
curve must be a function of the square of the abscissa, or 

y = /(**)• 

Here the function /(x*) is called the error function. Our problem now 
IS to determine the form of this function. 

Referring once more to the target experiment, we can readily see that 
if we had aimed at a particular point on the target line the distribution 
of shots with respect to the line would not have been different from that 
found in this experiment. Suppose, then, that we try another experiment 
of this kind and aim at some point 0 in the plane of the paper. The 
shots will be distributed about 0 in such a manner that if we draw any 


* Except for differpntials of higher order. 



Art. 148] THE PROBABILITY EQUATION 485 

line through 0 the probability that any shot hits at a distance c /rom 
this line will be 

? = /(«•)*. 

Let us therefore draw through 0 any two lines at right angles to each 
other. We shall take these as axes of coordinates for two variables x and y. 
Let us consider any shot that falls at a point P{x,y). The chance that 
P lies in a strip of width dx at distance x from the y-axis is 

and the chance that P lies in a strip of width dy at a distance y from 
the 2 -axis is 

p* = /(y*)<*y* 

The chance that P lies in hoik of these strips and hence in the small 
rectangle dxdy is therefore 

(1) P = P*P* = /(**)/(y*)d2dy. 

If we draw any other set of rectangular axes through 0, so that the 
coordinates of P referred to these axes are x' and we evidently have 

p^=/(2'*)d2', 

PW = /(»'*)<*»'• 

Hence the chance that P lies in the rectangle dx'dy' is 

(2) p' = /( 2 '*)/(y'*)d 2 '(iy'. 

But the chance that this particular shot falls within a small area A 
is the same regardless of the orientation of the axes through 0. Hence 
if we take dz' and dy' such that 

dx'dy' = dxdy = A, 

WC h&Y0 
or 

(3) /(*■)/(»>) =/(i^)W). 

Suppose now that the axes OX' and OY' are oriented so that OX' 
passes through P. Then 

2'= v** + p*, 5^ = 0. 

Hence (3) becomes 



460 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


(4) /(**)/(y‘) = /(*• + y‘)/(0) = C/(x* + y*), 

since /(O) is a constant.* 

Equation (4) is a functional equation and can be solved by first 
differentiating and then integrating. 

Differentiating (4) partially with respect to x* and y* in turn, we have 


r(**)/(!r‘) = c , 

Now since df(u -|- u)/du = df{u + o)/^v, the right-hand members of these 
equations are equal. Hence 


or 


/'(*■)/(»■) =r(ir‘)/(*‘). 

«*•) ' /(»■) ~ 


Multiplying the equation /'(x*)//(x*) = k through by (2(x*) and inte- 
grating with respect to x*, we have 

log, /(x*) = irx* + log, c, 
or 


(«) 




Now since the probability of an error decreases as the size of the error 
increases, it is plain that k must be negative. Putting Jt = — h*, we have 

(6) /(x«) = 

Hence 

(7) y = cr*'** 


is the equation of the probability curve. 

To determine the constant e we utilize the fact that the area under the 
probability curve is equal to 1. Hence we have 

(8) 1 = V»^dx = X X * *'“***‘*(^>' 

This intogral must be evaluated by an indirect method. To effect the 



Am. 143] THE PROBABILITY EQUATION 467 

etaluation let us consider the volume of the solid of revolution (Fig. 43) 
included between the xy-plane and the surface generated by revolving the 
curve z = e~** about the s-axis. Since this is a surface of revolution, its 
equation is 

(9) * = 

In cylindrical coordinates this equation becomes 

(10) * = e~^, where *• + y* = r*. 


Y 


Fio. 43 

' Taking as the element of volume a cylindrical shell of radius r, thick- 
ness dr, and height z, we have 

dV = 9wr ' dr ‘Z = Zwrzr^dr. 

(11) V — Zn ^ e-^rdr — — wj* s^( — Srdr) = — »(s'**J =*. 

Using rectangular cordinates, we take aa the element of volume a prism 
of base dxdff and alti^de i. Hence we have from (9) 




468 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


( 12 ) 


V = A ^ zdxdy = 4 ^ ^ e'^***’^*dxdy 
= 4 e'^dx C e'^dy. 

•/o t/O 


Now since the value of a definite integral depends only on its limits 
and not on the variable of integration, we may replace y by x in the 
second integral. We then have 

(13) 7 = 4 ^^e ’^dx ^*<^-^dx = [2 V'*da:J * 

Since we have already found 7 = 9 above, we have 


(14) 'Vx j * = ^, or "p ^dx=^. 

Substituting this in (8), we get c^h/y/ir. Now putting this value of 
c in (7), we have finally 



as the equation of the probability curve. 

Equation ( 143. I ) is of fundamental importance ; for it is the foundation 
of the Theory of Errors, the Principle of Least Squares, and the Precision 
of Measurements. It is known as the Probability Equation, Error Equation, 
etc. ; and its graph is known as the Normal Probability Curve, the Error 
Curve, Gaussian Curve, etc. 

It will be observed that this important equation contains only one 
arbitrary constant. This constant h is called the “ index of precision.’^ 
To see the reason for this name we notice that the larger h is the higher 
the probability curve will rise in the middle and the more rapidly it will 
fall on each side of the hump.” This fact, when considered in connection 
with the target problem, means that a large percentage of the shots hit 
near the target and very few hit far from it. In other words, it means 
accurate shooting. 


144. The Law of Error of a Linear Function of Independent Quantities. 
We shall next prove a fundamental theorem of great importance, namely : 

If Ml, Mi,- • - M„ are independent observed quantities whose laws of 
error are 

At hi A„ 

y =r ■— = y = “ — c'*>**’, • ' W = C'*"*** 

^ Vir ^ Vir Vir 

then any linear function of these quantities obeys a similar law of error. 



\HT l-t(] 


KHKOR Ol' \ IJ\K\K FI XCTIOX 


469 


Proof: Let the linear function be 

(1) h — o,\Mi -|- -}* ' 

where Oi, 02, • • a, are arbitrary constants. If Xj, X2, • • x, denote the 
errors of Mu Mt, Mn, respectively, and i denote the corresponding 
error in F, we have 


^ — ®i(*Ifi "H Xj) -j- a2(3/2 “H X2) -j- -j- -f* x») 

= a,3/, + a,Xi 4- 0,3/2 4- + + «n3/« + a<i-c« ■ 

Subtracting (1), 

(2) ^ = OiX, + a,X 2 + OnXn 

The error ^ m F is thus a linear function of the errors in Afi, Mi, etc. 
We are now to show that the law of error for ^ is the same as the laws 
for X,, X2, etc. 

To simplify the proof we first take a linear function of two independent 
quantities, 

F — 0 , 3 /, “ 1 “ cLiMi . 

Then 


(3) ^ = o,x, -1- O2X2 . 

Hence 

^ -f — a,(j, 4 - Ax,) 4- ai{xi 4- AX2). 

An error of magnitude x, to x, 4- Ax, in 3/, combined with an error of 
magnitude J2 to x, 4" Ax, in .V, will therefore produce an error of magni- 
tude ^ to ^ 4“ Af in F 

The probability of the oceurreiice of an error lying between x, and 
X, 4 - Ax, in Ml IS 

P, ^ - r > 

\ IT 

and similarly the ehanee of an erior lying between x, and X2 4" Ax, in Mi is 


/»' 


A 

Vir 




The probability that these two independent errors will occur simultaneously 
and thereby cause an error lying between i and ^ 4 A^ in F is therefore 
the product of their separate probabilities, or 

(4) P = p,p, = e *»’'»’Ax,Ax, . 



470 


THE NORMAL LAW OF ERROR 


(Chap. XVI 


This is the probability that any single error in Mi combined with any 
single error in Mi will produce a single error in F. But equation (3) 
shows that an error in F may be produced by combining any value of Xi 
(that is, any error in Mi) with all possible values of Xi from — oo to 
4*ao- Hence the total probability of an error between ^ and i* 

the sum of these mutually exclusive events, or 

(5) f , 

ir 4/-« 

where denotes the error function for 
Let us now consider a single definite error i in F. This means that 
(in (3) is to be considered constant for 'the time being. Hence from 
(3) we have 

( — u.ar, 

Xi — — . 

<h 

Substituting this value of Xj in (5), we get 

( 6 ) — f' 

To simplify the integration, we transform and simplify the exponent 
of e as follows: 

For convenience we write 



Now expand the squared term and reduce the whole right-hand member 
to a common denominator. The result is 


„ — — h,*(* + 2o,x,fc,»( — a.»x,*A,» 

E — ; , 

or 

B __ Oj*Xi*fc|* I 2a,Xi&g*( 

~ Oj* a,* a** 

Now multiply numerator and denominator of the first fraction on the 
right by ai*A»* -f- aj*hi*. We then have 

„ («.%* + . 2a.x,h,«( 

+ a,* + a,* 


which can be written in the form 



Art. 1441 


EHUOK OF A LINEAR FUNCTION 


471 


___a^h^h£e__ 

as* 

2a£^*_ arh,*e 

' 0** a2*(a,*/i** + a2*Ai*) ' 

a,»V -I- 

di^hj^ Oi2^hi\*‘ ii2^ 

, 2a.».^A2» . \ 

^ V’'* ai'fc** + a,*/.,- ■*■ (o,**** +■ 

h^Wf- at^ht- + a***,* / V 

“ a,*A2* + o,*/ix* a** \®‘ ax*A** + a2*AiV 

We now simplify this by putting C* = 01 *^ 2 * + Then 


E = — 


hrk^^e 

6 ’* 


r^- ( a.k^W 

v T' /■ 


The integrand in (6) now becomes 


which can be written in the form 

Since the first factor is independent of Zi, it may be removed from under 
the integral sign. Then (6) becomes 

= Ml Aa-. . c'- r *e-(C* . 

IT 


Now put 


_C/ a,k2*i\ 

" - a, V'* C* / • 

Then du = {C/at)dxi, or dxi = {a,/C)du, since ( is constant. Hence 
^U)M = ^ Ax2e-‘*****f/^ • / }--*d« ■ 


But 

by .\rt. 143. 

( 7 ) 


. '. Ax^e ^ . 


We have now taken account of the effect of the errors Zi in Mi in causing 



472 


THE NORMAL LAW OF ERROR 


(Chap. XVI 


a particular error £ in F, so that £ is now a function of Zt alone. Hence 
from (3), regarding Xi as a constant, we get Substituting 

this value for in (7) and replacing C by its value Vsi’ha* + Os’hi*, 
we get 



Vw 


or 

(9) 

where 

( 10 ) 


♦({) = A 

Vw 




A.%* 


^1*^2* -j~ Oa'Ai* 


The hiiv of error for (, the error in F, is thus of the same form as the 
laws of error for Xi and Xj, the errors in Mi and Ma. 

From (10) we have 


1 <*1*^2* 4* I 

' ^ Ht* ~ hi*h** hi* ^ ha* ■ 

To extend this relation to a linear function of any number of inde- 
pendent quantities, take 

F ~ A|Afi flahfi Aahfa — (UiJIfi -j- Oailfa) a^Mt , 

If ha denote the precision index of the errors in M,, and the precision 
index for F, then by (11) 

1 _ Oi* Oi* I At* I Aa* 

Ha* Hi*"^h,* hi* ha* h,» • 

In the same way, we can extend the formula to a linear function of 
4, 5, or any number of quantities. We therefore arrive at the following 
result: 

If ¥ be a linear function of n independent quantities which have been 
determined by observation, the function F follows an error law which is of 
the same form as the error laws of the independent unknowns. If the 
function is 

F — Ailfi -}- Aalfa Afhfa ’ ‘ * ”1“ SLwMm > 
the index of precision, H, of F is given by 



Art. 145] 


KVAIA ATIOX (JK TIIK I’HOBABIUTV INTEGRAL 


473 


Even when F is not a linear function of the independent quantities 
Afi, Mi, ■ • • Mn, the error $ in F will follow the Normal Law approxi- 
mately if the errors Xi, X 2 > • • • ar* are relatively small. For let 

( 12 ) F = f(M„M^,- ■ Mn) 

represent any function of Jl/,, Jl/j, etc. Then errors in the J/’s will cause 
an error in F according to the relation 

f i — f . “f" ■'V 2 "I" *2, • • Mn "I” ®«)- 


Expanding the right-hand member by Taylor’s theorem, as in Art. 6, 
we have 

(13) F -f < = /( A/„ M,, ■ Mn) + + 3^ ^* + • • • + ^ 

-}- terms in z,*, i,*,, etc. 


Now if X ,, Xi , etc. arc so small that their squares, products, and higher 
powers may be neglected, we have after subtracting (12) from (13) 


(14) 


dF 

* “ ?Af, 0Jlf2 


+ 


dF 

dAfn 


which is a linear function of x„ I 2 , etc. Hence by (122. 1) we have 


(144.2) ^ = 


/ dFV 

_\d.yj ■ WfJ 


fc,* 




where 1/ denotes the index of precision for the errors 


145. The Probability Integral and Its Evaluation. To find the proba- 
bility that an error of a given series will lie between the limits z, and 
Z 2 we merely find the area under the probability curve from z = z, to 
z = Z 2 , as shown in Art. 142. This means that we must evaluate the 
integral 


( 1 ) P = 

The integral 



I/.'----/. 

f%-'»'>’d(kx) 

,'o 




} 


can not be evaluated in finite form, but we can expand the integrand 
into a power series and then integrate as many terms as we need. Since 



474 


THE NORMAL LAW OF ERROR 


[Chap. XVI 



■— +'+S+.T+ + 5 + 

* ' ’ > 

we have 


• 

Hence 



II 

'3+5^21 rx 3 . 

4. <• 

^9X4! 


This series converges rapidly for small values of t, and the error com- 
mitted by stopping at any term is less than the first term omitted 
(Art. 12). For example, if t = ^ we have 

2 24 ^320 5376^110592 

0.5 — 0.04167 + 0.00313 — 0.00019 + 0.00001 

0.46128. 

This result is correct to the last figure, since the error is less than 




= 0.00000037. 


For large values of t the series (2) is not convenient for purposes 
of computation, because too many terms are needed to give the desired 
degree of accuracy. We shall therefore derive an expansion in descending 
powers of t, which may be used when t is large. 


Since 

/• « /*« 

/• • 


1 1 e-*'dt-\- 

1 


%/ 0 0 


we have 



(8) 

C*e-m= — 

ft/o •/o 



The value of the first integral on the right-hand side has already been 
found to be V<r/2. Hence (3) becomes 

( 4 ) 

The remaining integral on the right-hand side can be written in the form 

.-(-jwo n.-). 



Art. 145] 


EVALUATION OF THE PROBABILITY INTEGRAL 


475 


Integrating this last expression by parts, by putting u = 1/t, dv = 
we get 






dt 


or 


- 1 4.1 

”2 t ■*■4 








4 






By continuing this process of integrating by parts and substituting 
limits, we get the following expansion : 


( 5 ) 




13 5 
( 2 /*)* 



Substituting this in (4), we get 



113 1-3-5 

2 <* ( 2 <*)* ( 2 <‘'>* 


This series (6) is called an asymptotic series. It is divergent, but the 
terms in parenthesis decrease in numerical value so long as the number 
of terms does not exceed t^ •]- 1. This is the maximum number of terms 
ever used in computations with this series. The error committed in using 
(6) IS less than the last term retained.* 

As an example of the use of (6) we shall compute 


We have 





15 

512 


+ 


105 \ 
4096 y 


= 0.88623 — 0.0045789(1— 0.125 + 0.046875 — 0.0292969 + 0.0256348) 
= 0.88623 — 0.0042044 = 0.8820. 


* See Cheuveat’e Spherical and Practical Aetronomp, Vol. I, pp. 165-156. 


31 



476 


. THE normal law Of’ ERROR 


(Chap. XVI 


The error committed is less than 

0.0046789 X 0.025636 = 0.0001174. 

As a matter of fact, the number 0.8820 is correct to its last figure. 

By means of formulas (2) and (6) one could compute a table giving 
the value of the probability integral for any value of t. Such tables were 
computed long ago, find a table of this kind is given at the end of this 
book. This table gives the probability of an error lying between — t and 
+ 1, where i = hx. Since the probability curve is symmetrical with 
respect to the y-azis, the chance that an error lies between — t and -f~ ^ 
is twice the chance that it lies between 0 and -f* L Hence the probability 
of such an error is 



where t = hx. The use of the table will be explained in working the 
examples in the next article. 


Y 

I 

I 



X 


146. The Probability of Hitting a Target. Suppose we take a rect- 
angular target and draw through its geometric center two lines at right 
angles to each other and parallel to the sides of the target, as indicated 
in Fig. 44. Suppose, further, that we set up this target in a vertical 





A»T. 14fl| 


PROBABILITY OF HITTINC A TAROKT 


477 


plane at a convenient distance away and shoot at it 100 times with a good 
rifle. If the rifle is accurately aimed at the intersection of the dotted 
lines, the hits will be distributed symmetrically above and below the 
horizontal dotted line and to the right and left of the vertical dotted 
line, just as in the case of the pencil hits described in Art. 141. 

If we take the horizontal line as z-axis, the vertical line as y-axis, and 
a line through the intersection of these and perpendicular to the plane 
of the target as s-axis, the hits will be distributed on each side of the 
vertical line according to the formula 

( 1 ) 

and they will be distributed above and below the horizontal line according 
to the equation 

(2) 2=-^e-W. 

yjir 

The indices of precision hx and hy in the two directions may or may not 
be equal. 

Before we can apply formulas (1) and (2) to problems in target 
practice we must know the values of h, and hy for the particular gun at 
the given range. The precision of a gun is indicated by its probable 
error or its mean error (see Art. 155), and these are determined from 
firings at the proving grounds. 

If r and i; denote the probable error and the mean error, respectively, 
we have (see Art. 165) 


Hence 

( 3 ) 


0 4769 _ 0.6642 


hx — 


0.4769X 

r 


0.5642T 

V 


Note. When using the probability table for the solution of target 
problems the student must keep in mind the fact that the argument for 
this table is hx, where x is the given or allowable error; but since 
hx = 0.4769z/r = 0.5642z/i;, it is evident that the proper argument for 
entering the table is 
, , 0.4769X 


when the probable error of the gun is given, and 



478 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


... 0.5648* 

(*>) 

when the mean error of the gun is given. 

Example 1. For a certain 3-inch gun at a range of 4000 yards the 
probable errors were r« = 10.4 yards and = 5.8 yards. Find the prob- 
ability of hitting at the first shot a rectangular target 18 ft. high and 
30 ft. long. 

Solution. The probability that the shot will land in a vertical strip 
10 yds. wide is 

p, = f ’c-*.*'*d* = -%= f ; 

y/ir*' -i y/tr*' 0 

and the probability that the same shot will land in a horizontal strip 
6 yds. wide is 

The chance that the shot will land in both of these strips and therefore 
hit the target is 

p^p p i fV'M)*d(M) X --- i*e 

But 

O. 4769X 0.4769 X5 _ . 

_ _ _ _ _ - 0.229, 

by (3), and 

^0,847. 

Ty O.O 

Entering the probability table with these values of hx as arguments, 
we find 

P, = 0.254, P, = 0.273. 

Hence 

P = P,P^ = 0.254 X 0.273 = 0.0693. 

It would therefore require on the average about 1/0.0693 = 15 shots to 
get a single hit. 

Example 2 The mean errors for a certain gun at a range of 3000 
yards are 

rjf = 8.3 yds., = 4.6 yds. 

If 30 shots are fired at the side of a house 12 yds. wide and 6 yds. high 
at a distance of 3000 yards. 



Akt. 14fil 


PHOBAHIT.ITY OF HITTING A TARGET 


479 


(a) How many hits may be expected? 

(b) What is the chance of hitting a door 6 ft. X 3 the lower right- 
hand corner of the side of the house ? 

Solution, (a) If the gun is accurately aimed at the geometric center 
of the side of the house, any shot will be a hit if it passes within 6 yards 
of the central vertical line and within 3 yards of the central horizontal 
line. Hence we have 


X = 6 yds., y = 3 yds. ; and 

j_^^0^^ 0,564|X6^0.40r, 

= 0:5!^ =0,3*8. 

4.6 

From the probability table we find 

P, = 0.435, Py- 0.397. 

The chance of a hit for each shot is therefore 

P = P,Py = 0.435 X 0.397 = 0.173. 

For 30 shots the number of hits would probably be 30 X 0.173 = 5.2 or 
5, say. 

(b) To find the probability that the door would be hit during the 
bombardment we assume that the gun is aimed at the geometric center 
of the side of the house, as in (a). Then the door will be hit if a shot 
strikes within the rectangle bounded by the lines x = 5, x = 6, y = — 1, 
y = — 3. The chance of hitting the door at each shot is therefore 

p = p.r, = X 

= [v;/.'*- 

Vjt*' • “* 

Hence the two values of A«x to be used in the probability table are 
«|Mx6 = 0.40r .„d “If? X 5 = 0.340, 
for which the probabilities are Pje = 0.435/2, — 0.369/2. Therefore 



480 


THE NORMAL LAW OF ERROR 


(Chap. XVI 


P.= 


0.435 0.369 0.066 


2 2 
Likewise, the two values of h^y are 


2 


X 3 = 0.368, X 1 = 0.1226. 

4.6 4.6 

The corresponding probabilities are found from the table to be 




0.397 


0.138 
2 


JJ V/.lUO 

■ • 


•'*“2 

Hence = 0.397/2 — 0.138/2 = 0.159/2, and we have finally 


r. « V. r. 0 066 X 0.159 
Pz=P,X Py— T = 0.0026. 


The door will be hit unless every one of the 30 shots misses it. The 
chance that any shot will miss it is 1 — 0.0026 = 0.9974. The chance 
that every one of the 30 shots misses is therefore (0.9974)** = 0.9249. 
The chance of a hit is therefore 1 — 0.9249 = 0.0751. 

The door would probably be hit once out of the every 1/0.0026 = 380 
shots. 

Example 3. Find the number of shots necessary to make the odds 1U 
to 1 in favor of at least one hit on the side of the house mentionta in 
Example 2. 

Solution. The house will certainly bf* hit at least once unless every 
shot misses it. The chance that any shot will be a hit was found to be 
0.173. The chance that any shot will miss it is therefore 1 — 0.173 
= 0.827. The chance that every one of n shots will miss it is then 
(0.827)*. The chance of at least one hit is therefore 

P = 1 — (0.827)". 

Since the odds are to be 10 to 1 in favor of a hit, we have P = 10/11. 
Hence 

I — (0.827)- = ^, or (0.827)* =,V- 

J 1 J • 


or 


.•. n log (0.827) = — log 11, 


— log 11 _ —1.0414 
" ~ log 0.827 “ 9.9175 — 10 


— 1.0414 
— 0.0825 


= 12.6 = 13, say. 



Art. U7] THK PRINCIPLE OF LEAST SQUARES 481 

147. The Principle of Least Squares. Suppose we make a set of n 
measurements nti, m 2 , • • • m„ of some object or quantity in an effort to 
determine as nearly as possible its true magnitude, using the same care, 
methods, and instruments in making each measurement. If we try to 
read the measuring instrument to the finest subdivision of its graduated 
scale and even estimate fractions of a subdivision, we shall find that the 
results of the several measurements do not agree exactly among themselves, 
however much care we may use ; for each measurement is subject to unavoid- 
able accidental errors. How, then, shall we decide upon the best result 
obtainable from any given set of measurements or observations? 

This question is answered by the Principle of Least Squares, which 
says that the best or most probable value of the measured quantity is 
that value for which the sum of the squares of the errors is least. This 
answer is in accord with reason and common sense; for, since the acci- 
dental errors are real quantities their squares are positive quantities and 
the requirement that the sum of these positive quantities shall be as small 
as possible insures that the errors themselves shall be as small numerically 
as possible. 

Furthermore, the requirement that the sum of the squares of the errors 
shall be a minimum leads to the result that the arithmetic mean or 
average of the measurements is the best value obtainable from any set 
of equally trustworthy direct measurements. This result is also in accord 
with experience and common sense. 

The principle of least squares also follows from the Normal Law of 
accidental errors, as we shall now show. 

If we make a set of measurements all with equal care and use the same 
methods and instruments for each, the precision constant h of the prob- 
ability equation will be the same for all the measurements and the fre- 
quency of the accidental errors will be given by the same probability curve. 
If the accidental errors of the n measurements mi, m*, ■ ■ ■ m. be denoted 
by Xi, X 2 , • • • x„, respectively, then the respective probabilities of these 
errors are 

p — -^e *’*‘’<fx„ pz = -4=c'*’*’’dx„ • • •p, = -^e-**'"*dx, . 

^ Vw Vw Vw 

Since the separate measurements are independent events, the probability 
that the set of errors X|, Xj, * * * x„ will be made is the product of their 
separate probabilities, or 

(1) P = pipt • Pn = V**‘'‘***^* ♦*-*><iiidx, • • dXn. 



482 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


Now since small errors occur more frequently than large ones, a set 
of small errors is a more probable event than a set of large ones in 
making any set of measurements. Hcncc the set which has the 'greatest 
probability will give us the best or most probable value of the quantity 
measured; and since the differentials dx,, dx., etc. are perfectly arbitrary 
quantities (the smallest subdivisions of a graduated scale, for instance) 
it is evident from equation (1) that this probability P is greatest when 
the exponent of e is least, that is, when 

+ ■ ■ ■ + aTn* = is a minimum. 

Thus, by the principles of probability we arrive at the Principle of 
Least Squares, namely; 

The best or most probable value obtainable from a set of measurements 
or observations of equal precision is that value for which the sum of the 
squares of the errors is a minimum. 

Note. Any measurable quantity has a definite, true magnitude; and 
the differences between this unknown magnitude and the several measure- 
ments made to determine it are the true errors of those measurements. 
However, when these errors are required to satisfy the condition that the 
sum of their squares shall be a minimum, for the purpose of arriving at 
the most probable magnitude of the quantity, they become residual errors, 
or simply residuals (see Art. 149). But it is shown in Art. I.jI that the 
sum of the squares of the residuals is lea.st when the sum of the squares 
of the errors is least. 

148. Weighted Observations. If the measurement.s are not of equal 
precision, the values of h will be different. The probabilities of the errors 
will then be 


h\ h^ It f, 

Pi = dxi, pt = e-*^**» dxs, • *■*■'"* dxn ; 


V 


V 


and the probability of their simultaneous occurrence will be 


(1) P = p,p, 


hjhi ' ■ * hfi 

p, = ♦*-**-•> djjdx,. .dx„ 

{yir) 


The best value obtainable from this set of measurements will therefore 
be that for which 


( 2 ) = Ai*x,* -f- A2*X2* -j- • • + A„*x„* is a minimum. 

Since it is not customary in practice to make such an expression as 



Art. 148] 


WEIGHTPJD OBSERVATIONS 


483 


(2) a minimum, it is necessary to introduce here the idea of weighted 
measurements or observations. By the weight of an observation is meant 
its relative value or importance when compared with other observations 
of a set. Thus, if we measure a line three times with the same care and 
accuracy, we regard the mean of the three measuremeents as more reliable 
than any one of the single measurements. We express this by saying 
that tlie wciglit of the mean is three times that of a single measurement. 
An observation of weight w is therefore one which is equivalent in impor- 
tance to w observations of unit weight. 

To find the relation between weight and precision index let 
h = precision index corresponding to weight 1, 
hi = precision index corresponding to weight Wi. 


Then the probability of an error of magnitude x in the observations of 
unit weight is given by 


o HZ — 


and the probability of an error of the same magnitude in a set of observa- 
tions of weight Wi is 


hi 

Pi = -;=e 
^ Vir 


The probability of the same error (of magnitude x) in i&i observations 
of unit weight is 


P = ppp- 


• to tVi factors 



wi 




(dx)*®! . 


Now if the weighted observation (wt. w,) is to be worth as much as 
the Wi observations of unit weight, an error of magnitude x must have 
the same probability in it as in the case of the Wi observations. Hence 
we must have 

or 

hi / h \»i 

e-*>***dx= 

for any x. Taking logarithms to the base e, we have 
hi h 

log, — A,*x* s wi log, + (wt — 1 )log, dx. 



484 


THB NORMAL LAW OK ERROR 


ICiiAP. XVI 


Equating coefficients of like powers of x. 

hi^ = u)iA*, or 

Likewise, for observations of weights Wt, mi*, etc., we have 

h ^ 

— or — 

A* 


j. ' w' 

Aj- = m'jA-, or «>3 ; 

etc. 

The weights are therefore proportional to the squares of the precision 
indices. 

Substituting in (1) the values of Ai*, A 2 *; etc. as given above, we get 


P-- 






• -dXn. 


In order that P be a maximum we must have 


( 3 ) = **’1-^1* + M'z-Cj® + • • • -h tVnX„^ a minimum. 

We can now state the Principle of Least Squares in its most general 
form : 

The best value of an unknown quantity that can be obtained from a set 
of measurements of unequal precision is that which makes the sum of the 
weighted squares of the errors a minimum. 

149. Residuals. In the preceding articles of the present chapter we 
have been discussing the errors of observations and measurements. The 
true or exact magnitude of a quantity can not be found by measurement; 
for the unit of measurement and the quantity to be measured are, in 
general, incommensurable. Moreover, all measurements are subject to 
errors of some kind. It is obvious, therefore, that the error of a measure' 
ment can never be determined, the error being defined as the ttuc value 
minus the measured value. What we actually do, and all we can do, is to 
measure the quantity as many times as may be desirable or convenient and 
then find from these measurements the most probable value of the measured 
quantity. The difference between the most probable value and any particu- 
lar measurement is called the residual for that measurement. For con- 
sistency in .sign we always write 

Error = True Value — Measured Value. 

Residual = Most Probable Value — Measured Value. 



Abt. 150) MOST PROBABLE RESULT OF MEASUREMENTS 48.') 

Let mo denote the most probable value of a measured quantity and let 
^ 1 ) ^ 3 ) ‘ ’ mn denote the values of n separate measurements. Then if 

Vi) Vo, ‘ ’ ‘ Vn denote the residuals of these measurements, we have by 
definition 

Vi = mo — mi, 

Oj = mo — m*. 


Vo = mo — m«. 


150. The Most Probable Value of a Set of Direct Measurements. The 
definition of residuals leads us up to the problem of finding the most 
probable value of a set of measurements. Suppose we make n direct 
measurements on some unknown magnitude, how shall we determine the 
best value of the magnitude, on the basis of the n measurements? To 
give a general answer to this question we shall first assume that the 
measurements are of unequal weight. 

Let mi, mi, ■ ■ ■ mn denote the n measurements and let Wi, Wj, ■ ■ Wn 
denote their respective weiglits. Then if m denote the true value of 
the unknown magnitude, the errors of the several measurements are 


Xi=m — mi, Xi — m — m*, • • • ar* = m — m„ . 

Now the true value m is unknown and can not be found, but we must 
adopt some value for it. The principle of least squares says that the 
best value is that which makes the sum of the weighted squares of the 
errors a minimum (Art. 148) ; that is. 


(1) /(m)=u'i(m — nil)* u>i(m — wi,)*-!-’’ + u)»(m — mn)* 


must be a minimum. 

Differentiating (1) with respect to m, putting the derivative equal to 
zero, and replacing m by mo, which is to be the adopted value of m, 
we have 

Wi(mo — mi) + Wt(mo — mj) + • • • + u>n{mo — mn) = 0, 


from which 

( 160 . 1 ) 


mo = 


ttii mi + tOimi + • • • + tOnWH _ 2 v'w 
tVl - 1 - to* -f • • • + Iffn ~ 2tv 


This value m* is called the weighted mean of the several measurements. 



486 


THE NORMAL LAW OF ERROR 


If all the measurements are of equal weight, then W 1 — 102 
and (150.1) reduces to' 

(150. 2) mo = +3^ ‘ • • + »»» 


[CiiAP. XVI 
• • • = W«, 


ft 


which is simply the average of all the measurements. This result is in 
accord with experience and common sense. 

Formulas (150.1) and (150.2) enable us to prove the following im- 
portant theorem : 

In any set of measurements of equal weight the algebraic sum of the 
residuals is zero, and in a set of measurements of unequal weight the 
algebraic sum of the weighted residuals is zero. 

To prove this theorem let m, denote the most probable value of the 
n measurements m,, m 2 , ■ ■ • m»; and let v,, Vz, • • • denote the residuals. 
Then 

V, = mo — m„ 
v, = mo — m*, 


On = mo — m„ 

Adding these n equations, we get 

+ V* -+- ■ ■ • + — (mi + mz -f- * • • + "»») 

= nmo — nmo =0, by ( 1 r)*). 2 ) . 

To prove the second j)art of the theorem let Wt, Wj, -- Wn denote the 
weights of the .several measurements. The weighted residuals are 

WiV, = Wimo — Wim, , 

WiVz = Wzmo - - Wzmj , 


w„Vn = w„mo ~ w„mn . 

Adding these n equations, as before, we got 

WiVi-\-.W2V2-\- ■ • ■ WnVn = moiwi-{-W2+ • ■ ' u>«)— (u»imi -l-Wzmz-f • • •) 

= 0, by (1.50.1). 

This theorem provides us with a valuable check on the computed 
residuals in any set of measurements. However, since the residuals in 
such ea.ses are rounded numbers their algebraic sum will rarely be exactly 


zero. 



Art. 1511 


LAW OF ERROR FOR RKSIDI ALS 


487 


151. Law of Error for Residuals. We shall now show that when the 
errors of a set of measurements follow the Normal Law of error, the 
residuals likewise follow a similar law. To prove this let tn denote the 
true value of the measured quantity; nio the most probable value; ci, C 2 , * * ■ Cn 
the errors of measurement; v,, Vt, - • ■ Vn the residuals; and Wi, Wt,' • • 
the weights. Then 

{V) (.) 

Vi ~ IHo — zzz m — fill , 

1^2 — fHo ffl2 f €2 — fH ■ “ III 2 f 


Vn = mo — m„ 


cn = m — m„ 


For the case of measurements of equal weight we have from column (v) 

2 I 1 ; = nmo — ^fn = 0y or fiio = ; 

n 

and from column (c) we get in a similar manner 

2e = ntn — 2m, or 2*'* = — S** 

Substituting this value of Sm in the equation mo = Sm/n, we get 

n 


(1) 


nm — 2* 

mo = — = m • 


Now substituting this value of mo in the equations of column (v), we have 

( 2 ) 


Vi = m — — 2« — mi = m — mi — — 2< = «i — ~ 2< 
n n n 


1 1 1 

= ei — -<i — -cj — • • • — -<»> 

n n n 


or 




Similarly, 


/n — 1\ 1 1 

Vi — I 1 «, e, < 

\ n / n n 

- 1 . 1 1 
V* -- -Cl + 



We have thus proved that the residuals are linear functions of the errors. 
Hence by Art. 144 they follow the Normal Law. 



488 


THE XOHMAL LAW OF EHHOR 


[CnAr XVI 


If h is the precision index for the c’s and H that for the v’s, we have 
from (144.1) 


(iiziy 1 

1 _V n / , n* 


Hence 

(») 


^ n — 1 


n* 

1 



1 




f n 


Since the residuals follow the Normal Law, the probability equation 
for them is 

( 4 ) 

From (3) it is plain that the precision index for the residuals is a 
function of both h and n, and that it is always larger than h. This 
means that the graph of (4) rises higher in the middle and falls off 
more rapidly on each side than does the graph of (143.1). As the 
number of measurements increases, the graph of (4) approaches that of 
(143.1) more and more closely, and would ultimately coincide with it if 
the number of measurements were increased indefinitely. 

When the measurements are of unequal weight, the weighted residuals 
and weighted errors arc as given in the columns {wv) and (wt) below. 


(ttiii) 

iViVi = Wifno — Witni , 
WiVt = uijmo — , 

WnVn = Witno — tOnmn . 


(ttic) 

= iVitn — Witrii , 
Wgtt = W2m — Wtfnz , 

tOntn = tOntn — W,ti;» . 


On adding the equations (wv) we get 

2u)t> = fiio 2® — = 0, by (150. 1). 

. _ _ ^wm 

• • fltn — • 

By adding the equations in column (wc) we obtain 
2 w€ = m 2*® — or = m 



489 


Akt 151] LAW OF ERROR FOR RESIDUALS 

Substituting this value of twm in the expression for mo above, we get 

( 6 ) 

Hence 

(6) v, 

or 

Vi 

Similarly, 

Vi 

etc. 

Hence in the case of measurements of unequal weight the residuals are 
linear functions of the errors and therefore follow the Normal Law. The 
residual Vi, for example, would follow the law 



where 

1 _ 1 r(2:«;-u;.)» ttiQ* ^-| 

Hi* (2 w)*L hi* hi* hnU ■ 

And similarly for the other residuals. 

On squaring and adding the n equations v, =< 1 — (1/n)^, V 2 = ct 
— (l/n)2«, etc., we obtain 

( 7 ) = 

which gives the relation between the sum of the squares of the residuals 
and the sum of the squares of the true errors in any set of measurements 
of equal weight. Since both terms in the right member of (7) are 
positive quantities, it is evident that the sum of the squares of the residuals 
is always less than the sum of the squares of the errors, but that the 
difference is very slight. 

Inasmuch as the quantity Sc is very nearly zero in any set of measure- 
ments, the square of this quantity is still smaller and (l/n)(Sc)* is 
practically negligible in comparison with Sc*. Hence any small shift 
in the values of the c's would have very little effect on the already 




490 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


negligible quantity (l/n)(Sc)*. We may therefore consider this quantity 
constant for small changes in the c*s, and then it is plain that Sv’ is 
least when Sc^ is least. 

This can also be shown in a different way. From equation (7) we have 

n 

= 2 . 

Now when the number of measurements is large, the ]>roduct terms 2 ciC 2 , 
2^cs, etc. will be about half positive and half negative; and they will 
average about the same size. Hence they will cancel one another for the 
most part and then reduces to 

2v* = — ^***'^*'"'t — — 



From the foregoing considerations we are justified in asserting that 

The sum of the squares of the residuals is a minimum when the sum of 
the squares of the true errors is a minimum, and conversely. 

In a similar manner, on squaring the n equations t>i = ci — Swi/Sio, 
Vj = <2 — iwt/lw, etc., then multiplying the squared equations by the 
corresponding weights Wi, W 2 , etc. and adding the results, we get 

(8) Xww* = 2«’** — ^ 

Here, again, we see that the sum of the weighted squares of the 
residuals is a minimum when the sum of the weighted squares of the true 
errors is a minimum, and conversely, since the negligible quantity 
(l/Stv) (Swe)^ may be considered constant for small changes in the c’s. 

Remarks. Equation (1) shows that the arithmetic mean is equal to 
the true value of the quantity minus a very small quantity ; for since the 
errors are as likely to be positive as negative the quantity 2« is not large, 
and (l/n)S€ is still smaller. Hence the larger the number of measure- 
ments the nearer does mo approach the true value of the quantity measured. 
Equation (5) shows a similar result in the case of weighted measurements. 

Equations (2) and (6) show that any residual is equal to the corre- 
sponding error minus a very small quantity. Therefore when the number 



Art. 152] AGREEMENT BETWEEN THEORY AND EXPERIENCE 


401 


of measurements is large the residuals are practically equal to the true 
errors. Hence, although we can never determine the true magnitude of a 
measured quantity we can determine it as closely as we please by taking 
enough measurements. 

162. Agreement between Theory and Experience. At the beginning 
of this chapter we described an experiment which was designed to show 
the behavior and distribution of accidental errors. In deriving the Prob- 
ability Equation we made the assumptions that the probability of an error 
depended upon its size and that positive and negative errors of the same 
size were equally likely. These two assumptions were supported by the 
pencil experiment. The first is based upon experience, but the second is 
evident on purely a priori grounds and also supported by experience. No 
rigorous deduction of the Normal Law, based upon purely a priori con- 
siderations, has ever been given. The truth is that, for the kinds of errors 
considered in this book (errors of measurement and observation), the 
Normal Law is proved by experience. Several substitutes for this law 
have been proposed, but none fits the facts so well as it does. 

To show how well the Normal Law agrees with experience when the 
number of measurements is large, we give in the table below the results 
of 470 observations made by Bradley on the right ascensions of the stars 
Sirius and Altair. 


Size of errors 

Number computed 
from theory 

Number actually 
found 

OVO to OM 

95 

94 

OM to 0'.2 

89 

88 

0^2 to 0'.3 

78 

78 

O' 3 to 0'.4 

64 

58 

0'.4 to 0'.5 

50 

51 

0' 5 to 0'.6 

36 

36 

0'.6 to 0'.7 

24 

26 

0'.7 to 0'.8 

15 

14 

0'.8 to 0'.9 

9 

10 

o'.9 to r.o 

5 

7 

over I'.O 

5 

8 


It will be seen that the agreement between theory and experience is 
remarkably close, with the exception of the number of errors of magnitude 
from 0".3 to 0".4. 







492 


THE NORMAL LAW OF ERROR 


[Chap. XVI 


EXERCISES XYI 

1. Compute the value of the integral to seven decimal 

places. 

2. Compute the value of correct to four decimal places. 

3. Find the probability of hitting at the first sliot a rectangular target 
60 feet wide and 24 feet high at a distance of 4000 yards, the mean errors 
for the gun at this range being 

= 7.4 yds., — 5.2 yds. 

4 . If 10 shots are fired at a cylindrical standpipe 120 feet high and 40 
feet in diameter at a distance of three miles, find the chance that the 
standpipe will be hit if the probable errors of the gun for this range are 

r* = 14.2 feet, r„ = 10.6 feet. 

5. If the foretop of a battleship is a cylinder 12 feet in diameter and 
8 feet high, find the chance that it will be hit by a shot aimed at a point 
80 feet directly below, the mean errors for the gun in this case being 

= 42.6 feet, ijy — 36.5 feet. 

About how many shots would have to be fired at the ship (aimed at a 
point 80 feet below the foretop) before the foretop would be hit? 

6. Twelve measurements of the length of a line are given below. Find 
the most probable length of the line. 


364.2 

364.2 

364.3 

364.4 

363.7 

363.8 

363.9 

364.1 

364.3 

364.3 

364.5 

364.0 


7. Seven measurements of an object by different methods are given in 
the following table. If the weights Hif the different measurements are as 
given in the table, find the most probable size of the object. 


Measurements 

Weights 

369 2 

2 

368 3 

1 

371 1 

3 

370 2 

5 

369 1 

2 

370 6 

4 

372 2 

1 


Compute the residuals and weighted residuals. Find the algebraic sum 
of the weighted residuals and the sum of the weighted squares of the 
residuals. 






CHAPTER XVII 


THE PRECISION OF MEASUREMENTS 

153. Measurements, Direct and Indirect. Direct measurements are 
those made by methods and instruments whose indications give directly 
the quantity sought. Such measurements are usually made by reading 
a scale graduated in terms of the chosen unit. Yard sticks, clocks, 
voltmeters, chemical balances, etc. are instruments for making direct 
measurements. 

Indirect measurements are those m which the quantity measured is not 
given directly by observation or readings taken, but must be calculated 
from them. Thus, in an indirect measurement the quantity sought is a 
function of one or more directly measured quantities. For example, if we 
measure two sides and the included angle of a plane triangle we can find 
the remaining side and the area by means of the formulas 

a = Vfi* + c* — 26c cos A, Area = ^bc sin A. 

Here the directly measured quantities are 6, c. A, and the indirectly 
measured (computed) ones are a and the area. 

The relation between observed and computed quantities may be expressed 
by the general formula 

y — f (®i> ®s> ’ ■ o, 6, c, ’ •), 

where y and the x's represent observed or computed quantities and a, 6, c, 
etc. represent numerical constants. 

164. Precision and Accuracy. The words “ precision ” and accuracy,” 
when used in the discussion of measurements, have quite different meanings. 
Precision has to do with accidental errors, and a precise measurement 
would be one free from accidental errors. An accurate measurement, on 
the other hand, would be one free from all kinds of errors — mistakes, 
systematic errors, and accidental errors. Barring mistakes, the systematic 
error is thus the difference between the precise value and the accurate or 
true value of the quantity measured. If the systematic error should 
happen to be large, a precise measurement might be very inaccurate. The 
accuracy of a measurement can be increased by using more refined instru- 
ments and methods, whereas the precision can be increased only by using 
more care in making the measurement. 


493 



404 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


I. DIRECT MEASUREMENTS. 

155. Measures of Precision. The precision of a measurement can be 
estimated in several ways. The three measures of precision in common 
use are the following: the mean square error (m.s.e.), the probable error 
(P.B.), and the average error. These three measures are denoted by the 
letters /i, r, and 17 , respectively. We shall now derive expressions for them 
in terms of the precision index h. 

(a). The Mean Square Error (m.s.e.). In discussing the error equation 



in Art. 143, we stated that h is called the index of precision and indicated 
the reason for this name. Then in Art. 147 we found that the probability 
of the simultaneous occurrence of a set of errors Xi, X 2 > * ‘ in a given 
measurement is 

( 1 ) P = ptPa • • • p« = •♦*.») • • dxn. 

It was also shown in that article that the best or most probable result 
obtainable from a set of measurements is that corresponding to the 
maximum value of P. 

Let us now assume that a given set of n measurements has been made 
and let us try to find the best or most probable value of the precision 
index h for this set of measurements. It is that value which makes P a 
maximum and is found by differentiating P with respect to h and putting 
the derivative equal to zero. We thus get from ( 1 ) 

^ e-*’<*‘***^* - >[— 2h{Xi* 4- X*® + • • • + X,*)] 

^ *♦**•♦... ),l _ 1 _ —0 

or 

1 [_ 2h^(x,* + x,* + . . . + z»*) + n] = 0 . 

Wit/ Vw 

.*. — 2fc»(xi» + X,* 4- '•' + *•*)+« = 0, 

or 

1 _ 4- 4- • • •4- 

2 A* n ’ 



from which 



Art. 155] 


MEASURES OF PIIECISION 


495 


The quantity on the right is usually called the mean square error 
(U.S.E.) of a single observation and is denoted by the Greek letter n. 
We therefore have 

(2) u = + • • • + V 

AV2 ^ n 

(6). The Probable Error (i*.e.). The probable error, r, of a single 
measurement of a series is a quantity such that one half the errors of 
the series are greater than it and the other half less than it. In other 
words, the probability that the error of a single measurement will fall 
between r and — r is and the probability that it will fall outside these 
limits is Hence we must have 


h 

e-^*^*dx = , 

h ( 
or — = 1 

> 

a. 

1 

2 ’ 



Lrv****dx=i, 

V*'* 4 


since the probability that an error lies between any given limits is repre^ 
sented by the area under the probability curve between those limits. 

To find the value of r from the above equation we put 


Then 


and we have 


t = hx. 
dt — hdx. 


Now 


X 


^ , or 
0 4 


= 0.4431135, where p = hr. 


‘ -t- « ^ 24 120 






120 


+ • • • 


= 0 . 


4431135, 


or 

( 3 ) 


^5 ^7 -9 _ll 

p — ^ — 0.4431135 = 0. 

^ 3 10 42 ^ 216 1320 


This is the equation which we have already solved in Art. 75 and found 
p = 0.4760363. The value of p can also be found by interpolation, as we 
have already done in two ways in Ex. 2, Art. 27 and Ex. 1, Art. 34. 

Using now the relation p = hr, we get 

and from (2) we have 



496 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


Hence 


or 

(4) 


r = 0.4769 Vi" ‘ 

fl 

=0.6745 JiZ+iZ+IZZEfZ 

^ n ^ 

r = 0.6745 J^*+P*L+: ‘ . 

^ n. 


(c). average error is the arithmetic mean of all the errors of a 
set, without regard to signs. Thus, 


( 6 ) 


I I "f" I I "f* ■ ■ ■ 4* I I 

n— - 


To find an expression for i} in terms of h let us suppose that a set of n 
measurements has been made, and that each measurement is affected 
with an error of some size. In the case of any single measurement the 
probability of an error of magnitude x to z -|- Az is approximately 
yAz= (VVir) Ai (Art. 14^). Hence the probable number of errors 
of this size in the n measurements is n times this probability, or 
(nVV»)6‘***’Ax. The swm of these errors is therefore the number of 
errors times the size of a single error, or (nfc*/V"')c'*’**Ax. The sum of 
all the errors of dl sizes is therefore 



e-'-di = 



m 


or 


Hence 

( 6 ) 


= _ -^ f 2fe*idi) = - -^ r 1 " 

hy/ir^ -Jo 

5=-^. 

hy/ir 

_s_ 1 _ 

^ hy/v 


156. Relations between the Precision Measures. From (2) and (4) 
of Art. 155 we have 


( 1 ) 


r = 0.6745/i = §/*, roughly. 



497 


Art 1501 RKLATIONS BETWEEN THE PRECISION MKIASL'HRS 

and 

(2) /»= Q gY45 ~ l-4826r. 

Also, since 

we have 

(3) , = i ^ = ,A Jl = 0.79788/t 

" V'"’ ^ir 

= 0 . 8 / 1 , approximately. 

Hence 

(‘> '‘=Mfe8 

Furthermore, from (2) and (4) we get 

1.4826f = 1.2533ij. 

/EX . _ 1.2533 noAKfi 

(6) .. r=j^,_ 0.8463,. 

and 

//*\ 1.4826 - 4 ociA 

(6) ,= j^r= 1.1889,. 

All these relations are shown concisely in the following table : 


• 


r 

1 

M* 

1 0000 

1 4826 

1 2533 - 

r« 

0 6745 

1 0000 

0 8453 


0 7979 

1 1829 

1 0000 


167. Geometric Significance of is r, and From the definition of r 
it follows that its corresponding ordinate to the probability curve bisects 
the area under that curve on either side of the y-axis. 

The quantity /i is the abscissa of the point of inflection of the prob- 
ability ourve, as we shall now show. 

Taking the second derivative of 

y = 6***** 

^ Vir 


and equating it to zero, we have 



498 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


Hence 

or 


_ 

2A» 

<iz~' 

V» 

d*y _ 

2fc> 

dz* 

V» 


0 . 


1 — 2fc*x» = 0, 




hy/2 

The precision measure ri is the abscissa of the center of gravity of the 
area (under the curve) on either side of the y-axis. To prove this we 


Y 



X 


recall that if Zo denote the abscissa of the center of gravity of that area 
we have 




r xydx r X -h=e'*^^dx 
J J 9 yj It 



2AV 


0 




area 


1/2 


Art. 158] RELATION BETWEEN PROBABLE ERROR AND WEIGHT 499 


The relative sizes of the precision measures and their geometric relations 
are shown in Fig. 45. 

The question naturally arises as to which precision measure is the 
best for practical use. On this point there is no universal agreement. 
In continental Europe the m.s.e. is used almost exclusively, but in 
England and America the p.e. is more often used. The average error 
is also used in America, but usually under the name average deviation. 

The M.S.E. is used almost exclusively in Mathematical Statistics, where 
it is called the standard deviation and denoted by a. 

The average error is the easiest of all to compute, and the p.e. is the 
most laborious, because of the factor 0.6745. Nevertheless, in this book 
we shall conform to American practice and use the p.e. almost exclusively. 


158. Relation between Probable Error and Weight, and tiie Probable 
Error of the Arithmetic and Weighted Means. In Art. 148 we derived 
the relation between the precision index h and the weight w of an 
observation, namely: 


(158.1) 


— — . .-bl 

Wi W 2 V>3 ~ Wn ' 


Then in Art. 155 we found the relation 


r = Y , where p = 0.4769. 

ft 

Hence 

h = . 

r 

Let WifiVt,- • - Wn be the weights of observations whose probable errors 
are r„ r., • • -r*, respectively. Then 



Substituting these values for h|, * • • li* in (136. 1), we get 



r,*w, rj*W2 rn*w« ’ 

or 

1 _ 1 _ . . . _ _1_ 


r,*w, “ r**W2 “ r,*Wn ’ 

Hence 


( 158 . 8 ) 

^ = ^.etc. 

w, r,* 



500 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


The weights are thus inversely proportional to the squares of the probable 
errors. 

This relation (158.2) enables us to find the p.e. of the arithmetic and 
weighted means of a set of n direct measurements. 

To find the p.e. of the arithmetic mean of n direct measurements of 
equal weight, let the weight of each measurement be 1. Then the weight 
of the mean of all the measurements will be n. Denoting by r the p.e. 
of any single measurement and by fo the p.e. of the mean of all the 
measurements, we have from (158.2) 

1 r„» , r* 

— = -r . or fo* = — . 


Hence the p.e. of the mean is 

(168. 3) 




If the measurements are not all of equal weight, let w,, f& 2 , ■ * * w% 
denote their weights. Then if r denote the p.e. of a measurement of 
unit weight (w = 1) and n the p.e. of a measurement of weight Wt, we 
have from (158.2) 


Hence 

(158.4) 


In* , r* 

— = - T , or f,* = — . 
«;< r* Wt 


Now the weight of the weighted mean is 2u> = tt;i -f- (<’2 + ’ ' ' 4* 
Hence by (158.4) the p.e. of this mean is 


(158.5) ro=—L==: ^ 

V2t» Vwi + tt'a + • • • + “'» 

Formula (158.3) shows that the p.e. of the arithmetic mean can be 
decreased by increasing the number of measurements. A glance at the 
graph of this equation shows, however, (see Fig. 46) that the decrease 
is very slight after several measurements have been made. Usually it 
does not pay to make more than ten measurements for the purpose of 
reducing the p.e. of the arithmetic mean. 

159. Computation of the Precision Measures from the Residuals. So 

far in our discussion of precision we have been considering the errors of 
measurements. Since the true errors can not be found, it is necessary 
to derive formulas for the precision measures in terms of the residuals. 



Art. 159] 


COMPUTATION OK PRECISION MEASURES 


501 


Tn Art. 151 it was shown that when the errors of a set of measurements 
follow the Normal Law of error, the residuals likewise follow a similar 
law. The probability equation for any residual will therefore be of the 
form 


( 1 ) 

lo 



for measurements of equal weight, where II — h.^/ n/(n — 1). (See Art. 

151.) For a set of n direct measurements of equal weight we therefore 

have for the n residuals the following probabilities: 

Pt = Pi — ■ • • Pn= . 

Vir V'"' Vw 

The chance that this particular set of residuals will be made in any set 
of measurements is then 

(2) P = p,p, • • • p„ = ^ e-W(v.=*r.N ■■^v,*)dvidv2 • • • dv, . 

Differentiating (2) with respect to H and putting the derivative equal 
to zero, exactly as was done in Art. 155, we get 

— U = + • • • + 

//V2 ^ n 



502 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


Hence 


1 

H~ 

II 

W*|H» 

9 

1 

I-* 

and _ = u . 

AV2 

1 

- ^ x/»- 

1 -u J»— 1 

//V2“ 

AV2 ' n 


/n-1 


' + • • • + »,* 

* n 


n 


^ J vx- + <>.» + •• + t;.» _ J . 
^ » — 1 ^n— 1 


Therefore 

(159. 1) r = 0.6745/t = 0.6745 . 

^ n — 1 

For the p.b. of the arithmetic mean we hare 

(159. 2) r„ = -A= = 0.6745 

^ V« ^n(n—l) 

If the measurements are not all of equal weight, the residuals will not 
have the same weight. They can all be reduced to unit weight, however, 
by multiplying each of them by the square ro^t of its weight. This 
follows from (158.4), since rjV«'i = r. 

Let Vi, ii 2 , • • v* be the residuals of n measurements of weights 
v>i, iOz,’ ' ' Wn. Then the residuals reduced to unit weight are 

i/jz=i;iVwi, 

if^ = v„y/wn. 

Squaring these equations and adding, we get 

(6) Sv'* = 

Now for a set of measurements of equal weight we have from (159. 1) 


r = 0.6745 




Replacing by its equal from (6), we get 



Art. m] 


COMBINATION OF SETS OF MEASUREMENTS 


503 


(189. 8) r = 0.6745 = 0.6745 + ; • • + 

^n — 1 ^ n— 1 

This is the p.e. of a single measurement of unit weight. 

To find the p.e. of a measurement of weight Wt and the p.e. of the 
weighted mean we have from (158.4), (158.5), and (159.3) 


(169.4) 
(159. 6) 


r« = — = = 0.6745 


Vwi 


4 




{n—l)Wi 


ro = 


V2:a> 


= 0.6745 


4 




(n — 1)2®' 


It will be observed that (159. 5) reduces to (159. 2) when all the weights 
are equal. 

We now collect for easy reference the fundamental formulas for com- 
puting the p.e. of direct measurements. 

(a) Measurements of equal precision, p.e. of a single measurement: 


(159.1) r = 0.6745 . 

' n — 1 

P.E. of arithmetic mean: 

(159. 2) To = 0.6745 ~t~ • : • + J!?' . 

^ n(n — 1) 


(b) Weighted measurements, p.e. of a single measurement of unit 
weight : 

(159. 8) r = 0.6745 . 

^ n — 1 

P.E. of measurement of weight Wii 


(169. 4) r» = 0.6745 

P.E. of weighted mean: 


4 


4- 4- • • • -t- Wnt’ii* 

(n — l)tff« 


(169. 5) r« = 0.6745 J ”> 1 V 4^- 

^ (n — l)(a>, -f w* + • • • + «'••) 


w.v. 


160. The Combination of Sets of Measurements when the P.B. ’s of 

the Sets are Given. When several separate determinations of the magni- 
tude of a quantity have been made by different observers or by different 
methods and the probable errors of the separate determinations are given, 
it is important to know just how to combine these several results so as 
to obtain from them the best value for the measured quantity and the 



504 


THE PRECISION OF MEASUREMENTS' 


fCHAP. XVII 


probable error of this best value. For example^ the results of five different 
determinations of the atomic weight of silver are given below. How can 
we obtain from them the best value for the atomic weight and how can 
we find the p.e. of this value? 

107.9401 ± 0.0058 
107.9406 ± 0.0049 
107.9233 ± 0.0140 
107.9371 di 0.0046 
107.9270 ± 0.0090 

This is really a problem in indirect measurements, but it can readily 
be solved by the methods already given. The proper method of procedure 
in a problem of this type is first to compute by the relation (158.2) the 
weights of the several determinations from their given probable errors 
and then find the weighted mean of the given values of the measured 
quantity. The p.e. of this weighted mean is to be computed by formula 
(168.6). See Example 4. 

Note. Some authors compute the p.e. of the weighted mean in this 
case by finding the residuals of the sets of measurements and then finding 
the P.E. of the weighted mean by formula (169.6). Such a method is 
incorrect, and the p.e. of the weighted mean found by this procedure 
may be worthless. For an investigation of this matter from several angles, 
see the following papers: 

1. ‘*The Invalidity of a Commonly Used Method for Computing a 
Certain Probable Error.’* Proc. Nat. Acad. Sd., Vol. 15, No. 8 (August, 
1929), pp. 665-668. 

2. “ On the Computation of the Probable Error of a Weighted Mean.” 
Am. Math. Monthly, Vol. XLII, No. 5 (May, 1935) pp. 286-301. 

We shall now show the use of the formulas derived- in the preceding 
sections. 

Example 1. The following measurements were made to determine the 
length of a base line in a geodetic survey. Find the most probable 
length of the line, the p.e. of a single measurement, and the p.e. of the 
arithmetic mean. 

Solution. The measurements, residuals, etc. are arranged in tabular 
form as shown below. The first step in the solution is to find the arith- 
metic mean of the given 'measurements. Then the residuals are found 
by subtracting each measurement from the arithmetic mean. 



Art. 160] 


COMBINATION OP SETS OF MEASUREMENTS 


506 


Ml = 465.35 

Vi = — 0.02 

Vi* = 0.0004 

Mt = 455.35 

t>2 = — 0.02 

V,* = 0.0004 

M» = 455.20 

v, = -f 0.13 

V,* = 0.0169 

Mt = 455.05 

Vi = -f- 0.28 

Vi* = 0.0784 

Ml — 455.75 

V, = — 0.42 

V,* = 0.1764 

Ml = 455.40 

V, = — 0.07 

Ve* = 0.0049 

Mj = 465.10 

v, = + 0.23 

V,* = 0.0529 

Ml = 455.30 

V, = -f 0.03 

Vg* = 0.0009 

Ml = 455.50 

v, = — 0.17 

V = 0.0289 

Mil = 455.30 

Ojo — 0.03 

Vio* = 0.0009 

= 10 X 455 -f 3.30 

2v = o 

= 0.3610 


Mo- 


10 X 455 + 3.30 
10 


455.330. 


r = 0.6745 
0.135 




3610 


= 0.135, by (159.1). 


ro = 


Vio 


= 0.043, by (159.2). 


The length of the line is therefore to be written 

M = 455.330 ± 0.043. 


Note. The number of significant figures to be recorded in the most 
probable value (arithmetic or general mean) is usually one more than 
the number given in the individual measurements (Art. 7). If the p.e. 
of the final result should be relatively large, however, we are not justified 
in recording this result to more figures than are contained in the separate 
measurements, and in such cases we record the final result to the same 
number of figures as given in the data. 

The P.E. of the result is recorded to only one or two significant figures — 
just enough to extend to the last figure of the mean. Slide-rule accuracy 
is therefore amply sufficient in the computation of probable errors. 

In finding the residuals we use only as many figures in the mean as are 
given in the individual measurements. 

Sometimes too much importance it attached to the probable error of a 
mean and too little to the mean itself. The mean is the important thing 
in any set of measurements or in any combination of sets; the probable 
error is of secondary importance. Theoretically, the weights of the separate 
means should be used in computing the general mean, but in the com- 
bination of only a few sets of measurements— from two to five, for 



506 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


example — such a procedure is of doubtful value. When only a few sets 
are to be combined, the simple average is usually better than the weighted 
average and is much easier to compute. 

Example S. The following measurements were made to determine a 
certain wave length. Find the most probable wave length and its p.e. 

Solution. Here we first find the mean and then the residuals as before. 
The rounded mean is correct to its last figure as given, but since the last 
digit is slightly less than 5 the mean when rounded to three decimals is 
4.505. From this number we subtract the individual measurements to 
find the residuals. 


n 

M 

V 

V* 

1 

4.524 

— 0.019 

0.000361 

2 

4.500 

+ 0.005 

0.000025 

3 

4.515 

— 0.010 

0.000100 

4 

4.508 

— 0.003 

0.000009 

6 

4.613 

— 0.008 

0.000064 

6 

4.611 

— 0.006 

0.000036 

7 

4.497 

+ 0.008 

0.000064 

8 

4.507 

— 0.002 

0.000004 

9 

4.501 

4- 0.004 

0.000016 

10 

4.502 

+ 0.003 

0.000009 

11 

4.485 

+ 0.020 

0.000400 

12 

4.519 

— 0.014 

0.000196 

13 

4.517 

— 0.012 

0.000144 

14 

4.504 

+ 0.001 

0.000001 

15 

4.493 

-H 0.012 

0.000144 

16 

4.492 

+ 0.013 

0.000169 

17 

4.505 

0.000 

0.000000 


Mo = 4.6055 

= — 0.008 

= 0.001742 


ro = 0.6745 = 0.0017, bv (169.2). 

^17X16 

.*. Jl/ = 4.5055 ± 0.0017. 


Remark. Theoretically the algebraic sum of the residuals should be 
zero, but since these residuals in any actual problem are necessarily rounded 
numbers their algebraic sum is rarely zero. However, if the algebraic 
sum of the residuals is not practically zero, there is a numerical mistake 



AKT. 160] 


ILLUSTRATIVE EXAMPLES 


607 


either in the mean or in some residual. Hence it is very important to 
check the computation by noting vhether Sv is practically zero. 

Example S. Six measurements of the parallax of a star are given in 
the following table. Find the most probable value of the parallax and 
its P.B. 


4 


M 

w 

>1 wM 

V 




8 

4.056 

-0.104 

0.010816 

0 086528 


5 

2.190 

-0.035 

0.001225 

0.006125 


2 

0.762 

0.022 

0.000484 

0.000968 

0V371 

8 

2.968 

0.032 

0.001024 

0.008192 

(r.350 

13 

4.550 

0.053 

0.002809 

0.036517 

0V402 

20 

8.040 

0.001' 

0.000001 




:lu)M 





Xu;«i56 

1 

-22 566 


1 

£tiw*-0. 13835 


Jfo = = 0".403. 

Do 


Hence the final result is 


To = 0.6745 .J013835 q.OIS. 
^ 6 X 56 


M =r 0''.403 ± 0".015. 


Here the p.e. of the weighted mean is so large (relatively) that we are 
not justified in recording the result to more figures than are given in 
the data. 


Example i. Seven separate determinations of the difference of longi- 
tude between two places gave the following results. Find the most 
probable value of the longitude difference and its p.e. 


1 

19” 

1*.42 ± 0».044 

2 

19 

1 .37 ± 0 .037 

3 

19 

1 .38 + 0 .036 

4 

19 

1 .45 db 0 .036 

6 

19 

1 .60 d: 0 .046 

6 

19 

1 .55 d: 0 .045 

7 

19 

1 .57 ± 0 .047. 

Solution. 

The first step in 

the solution of this problem is to find the 


weights of the different determinations from their given probable errors. 
From Art. 168 we have 


33 











508 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


Hence 


JL A g-y 

i*Wi — r,*Wt rt*w, c ' ‘ ' 

ri*Wi = = • • • = rj*ufr — c. 


Let us take the weight of the last determination as unity, that is, let 
us put 

IVt = 1. 


Then 

Hence 


e = rr*= (0.047)*. 


Hence ^ , v - 

c ^ 0.047 Y_/47y_ 

c /0.047V_ /47 Y_ g 
fAo 037 ) \37/ 

In like manner we find 

Wa = 1.70, i»4 = l-'J'O, Ws = 1-04, Wt = 1.09. 

To save labor in the computation of the weighted mean let us denote 
hy dy da ■ • d, the differences between the various determinations and 

an assumed approximate value of the weighted mean, say 19“ 1«.40. Then 
the various determinations are 19“ 1-.40 + d., 19“ 1*.40 + etc.; and 
their weighted mean is 


-r I ; 

Ma— tt>i + + Wi 


(iPi + Wa + • 


= 19“1*.40 


. . . . 4- m;t)( 19“1».40) + to.d, + ttfada + • ‘ 
fffi + Wa + ■ ' • + “'» 
tg.d. 4- W,d, + • • • + v’-d, 

U»i Wa + ■ • 4* “'t 


V>ada 


This equation shows that it is necessary to multiply only the d’s by 
the weights. We therefore complete the solution by making out the table 
shown below and then using (168.6). 




Art. 160 ] 


EXERCISES 


509 


Ifo = 19“ 1* 40 -f = 19“ 1*.40 + 0.061 = 19“ 1».461. 

Then since the weight of Jfr is assumed to be 1, we substitute the value 
of fr in the formula (168. 6) and get 


ro = 


0.047 


= 0*.015. 


V9.28 

M = 19“ 1».461 ± 0*.016. 


Note. The reader is reminded that the expression M = M^±.r does not 
mean that the true value of M is somewhere between Ifo + v and Jfo — r; 
nor does it mean that M is probably in error by the amount r. It means 
that, so far as accidental errors are concerned, the true value of Jlf is just 
as likely to lie between and 3fo — r as it is to lie outside of these 

limits. 


1. Ten measurements of equal precision were made to determine the 
density of a body, the results of the measurements being as follows : 9.662, 
9.673, 9.664, 9.659, 9.677, 9.662, 9.663, 9.680, 9.645, 9.654. Find the 
probable error of a single measurement, the most probable value of the 
density, and its P.x. 

2. Twelve measurements of an angle in a primary triangulation gave 
the following results. Find the p.s. of a single measurement;' the most 
probable value of the angle, and its P.E. 


116 43' 44". 45 

50 .95 

49 .20 
47 .40 

51 .05 

50 .60 


EXERCISES XVII 


116 43' 61" .76 
62 .35 
61 .05 
49 .05 
49 .26 
49 .26 


3. Ten measurements of the coefficients of expansion of dry air gave 
the following results. Find the most probable value of the coefficient and 
its P.E. 


3.643 X 10-* 
64 
44 
60 
53 


3.636 X 10-* 
61 
43 
43 
46 



610 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


4 . A certain coefficient of eipansion was measured with different 
apparatus with the following results. Find the best value for the coefficient 


and its F.n. 

Measurement Weight Measurement Wei|^t 
0.0046 3 0.0036 2 

0.0039 2 0.0026 2 

0.0034 6 0.0027 1 

0.0030 4 0.0043 3 

5. An angle was measured seveml times with a transit and then several 
times with a theodolite, with the following results: 

Theodolite 36“ 41' 28'' ± 11'' 

Transit 36® 41' 23".8 ± 2".7- 


Find the most probable value of the angle and its P.X. 

6. Six determinations of the velocity of light by different observers 
at different times gave the following results, with their probable errors : 

298000 ± 1000 
298500 dt 1000 
299930 + 100 
299990+ 200 
300100 + 1000 
299944 + 50 

Find the most probable value obtainable from these determinations and 
its P.B. 

7. Find the best value of the atomic weight of silver and its p.e. from 
the following determinations : 

107.9401 + 0.0058 
107.9406 + 0.0049 
107.9233 + 0.0140 
107.9371 + 0.0045 
107.9270 + 0.0090 

II. INDIRECT MEASUREMENTS. 

161. The Probable Error of any Function of Independent Quantities 
Whose RE.’s are Known. 

Let 

(1) Q = /(gi> g*. 7*. '• • ?•) 



Art. 161 ] 


611 


PROBABLE ERROR OF A FUNCTION 

represent any function of directly measured quantities ji, q*. 

Then errors Aqi, * ’Aq, in the q's will cause an error AQ in the 
function Q, so that 

Q AQ = /(qi -f- Aqi, q, -f- Aq*, • • • q* + Aq»). 

Expanding the right*hand member by Taylor^s theorem and proceeding 
exactly as in Art. 6, we get 

This expression for AQ holds for any kind of errors whatever. If 
Aqi, Aqs, ■ ■ * Aq» are acddenial errors, so that they obey the Normal 
Law of error, then AQ is likewise an accidental error which obeys the 
Normal Law, as proved in Art. 144. In this case equation (2) is exactly 
like equation (2) of Art. 144, and all the results of that article apply to 
it. Hence if H, hi, ht,- * * denote the precision indices of Qi, q^, 
qt,- ' ,qn, respectively, we have from (144.2) 



Let us denote the probable errors of Q, q,, q 2 , ' • • q» by R, fj, r ,, • • t,, 
respectively. Then from the relation f=zhr found in Art. 156 we have 


1 

H* 


R* 1 


1 %* 

» « — « ■ mm — j” p * * ' T • — jr , where p — 0.4769 
P *1 p Al p* hn P* 

Substituting these values of 1/H*, 1/ht*, etc. in (3) and reducing, we get 


n!, JL 

•» k,» 




This formula is of great importance, for it includes all possible cases 
of a function of directly measured quantities. It expresses the law of 
the propagation of errors and is the foundation of the whole subject 
of indirect measurements. 

The terms relative error and percentage error may also be applied to 
probable errors. The fundamental formula for the relative error in indirect 
measurements is obtained by dividing (161. 1) throughout by Q. We 
then have 



(161. S) 



612 THE PRECISION OF MEASUREMENTS [Chap. XVII 

for the probable relative error. The probable percentage error is 100 
times this. 

Formula (161.2) assumes a very simple form when Q happens to 
be a product of several functions or a logarithm of a single function. 
Suppose, for instance, that 

(161. 3) Q = Kx”'y*z* . 

Then 

^ — ^—Q±, . 

dx X dy y dz s’ 
and when these are substituted in (161.2) we get 

(161.4) | = V™‘(5-) + n'(^) + p-(^)’ 

for the probable relative error of Q. 

It is worth while to notice here that the p.e. of the weighted mean 
of several sets of measurements whose p.e.*8 are given (Art. 160) can 
be found by the methods of the present article; for the weighted mean 
may be written in the form 


ilfo = 






which is a linear function of the ill’s. Hence on substituting in (161.1) 
the partial derivatives dMo/dM^ = t&i/Sw, dMo/'dMt — Wt/Hw, etc., we get 


( 4 ) 




Vf, 


i^wV 


ri* + 


fiij* 


i^wy 


r** + • • + 


w- 


awy 


But since r,* = r,* = c/«»*, etc., we have 


J- + 


(2w)* «/, (2tp)* tPj 


Wt c 

■a ;r+ -- + 




(2w)* Wn 


_ Vc 


_ Vc _ jsU 


= ^v». + ».+- ■ + = 

Now if we take Wi = 1 (i = 1, 2, • • • n), we get Vc = n and therefore 


R = 


U 

VSic* 


which is formula (168.6). 



Art. 162] 


PROBLEMS OF INDIRECT MEASUREMENTS 


613 


On putting = = - we get ra = r, = - • =:r„ by (168.2). 

Then (4) reduces to 

R= ^ , 

yn 

vhich is formula (168.3). 


182. The Two Fundamental Problems of Indirect Measurements. The 

two main problems of indirect measurements are the following: 

1. Given the p.e.*s of a number of directly measured quantities, to find 
the P.E. of any function of these quantities. 

2. Given a prescribed p.e. of the function, to find the allowable p.e.’s 
of the directly measured quantities. 

The first of these problems is solved by substituting the data directly 
into formula (161. 1) or (161. 2), according as the given p.b.’s are absolute 
or relative. 

The second problem is mathematically indeterminate when the number 
of directly observed quantities is greater than one. For a function of a 
single quantity, say 

Q==f(x), 

we have by (139. 1) 




dx 


HOa). The Method of Equal Effects. If, on the other hand, Q is a 
function of several directly measured quantities, we obtain a definite 
solution by using the method of equal effects, as explained in Art. 10. 
This method assumes that all the components (directly measured inde- 
pendent quantities) contribute the same amount to the resultant error in 
Q. Under these conditions all the terms under the radical in (161.1) 
arc equal to one another, so that 





,-dQ .-ag 



r, . 


Hence 

( 1 ) 


R R R 





In some problems the p.b.’s of some of the components are so small 
in comparison with the others that we may neglect them entirely when 
applying the method of equal effects, thereby simplifying the problem. 



THE PRECISION OF MEASUREMENTS 


614 


[Chap. XVII 


Thua, if we wished to find the local time at any place on the earth’s 
surface, we could compute it from the formula 


cos t = 


sin h 

cos L cos d 


— tan L tan d 


as soon as we knew the altitude (A) and declination (d) of a heavenly 
body and the latitude (L) of the place. The declination can be found 
from the Nautical Almanac to a hundredth part of a second of arc, but 
the altitude and latitude have to be measured at the place where the local 
time is wanted. If these are measured with a sextant or an engineers’ 
transit, they can not be measured much closer than to the nearest minute 
of arc. Hence the declination is known so much more accurately than 
the altitude and latitude can be measured that we may treat the declination 
as free from error, so that the error in t will be due entirely to the errors 
in h and L. If, therefore, we desired the local time to the nearest 
second, we would treat f as a function of h and L alone, take n = 2, and 
find the allowable p.e.’s of h and L by means of formulas (1). 

To find out whether the error in any particular component has a 
negligible effect in producing an error in the function Q we apply the 
following criterion: 

HOh). Criterion for Negligible Effects: If any component qi, has a 
negligible effect in causing an error in Q, then we must have * 


( 2 ) 


a?* *=3^’ 


where R is the stipulated f.e. of Q, If several components 91, 92 , ' ‘ '9« 
should each satisfy (2), they nuy all be neglected provided 


(3) 




a?*/ 


+ - • • + 




When applying the criteria (2) and (3) to any particular problem, 
we are supposed to know in advance the size of the p.b.’s of the components 
we contemplate neglecting, as in the case of the declination d in the 
astronomical problem mentioned above. If we know nothing concerning 
the size of the p.e.’s whose effect we contemplate neglecting, then the 
best we can do is to apply the method of equal effects to the terms under 
the radical in (3), thereby obtaining 





* See Palmer’s Theory of Meaeuremenie, p. 151. 



Art. 1621 
from which 


PROBLEMS OF INDIRECT MEASUREMENTS 


61S 


R B B 

Fi ^ , r, ^ , • • • r* ^ . 

Zy/m^ Zy/m^ Zy/m^ 

o?i 3?2 

We may therefore neglect the effect of m components qt, 9a> ‘ if 
each satisfies the condition 


(■4) 






(k = 1,2,Z,‘ • '»»). 


The proofs of criteria (2) and (3) are simple and easy, but they will 
not be given here.* 

We shall now apply the preceding formulas to some examples. 


Example 1. From the simple pendulum formula 


we get 




= /(i,r). 


If 1 = 100 cm. and T = 1 sec., find the error in g due to errors of 0.10 cm. 
in I and 0.0020 sec. in T, respectively. 

Solution. Differentiating g with respect to 1 and T separately, we have 


dg IT* J|£ 2y*l 

m T* ' 


From this point onward we proceed in one of two ways, depending on 
the TnftRning of the errors in I and T. 

(a) If the errors in I and T are actual, definite errors of the magni- 
tudes given, then we compute the error in g by the formula 


Ay — j'JaI -f g^AT. 


[See (6. 1)] 


Hence 


A *"*Ai 

^9 = 


= 9.8696(0.10 + 200 X 0.002) = 4.935 cm./sec.* = ^ say. 
Since we do not know the signs of AT and Al, we disregard the negative 


8«e Palmer’a Thmtty of Meoeuremmte, p. 161. 



616 THE PRECISION OF MEASUREMENTS [Chap. XVII 


sign on the right and take the arithmetic sum of the terms. This gives 
the maximum numerical value of ^g. 

(b) If the given values of I and T are the means of several measurements 
and their given errors are the p.e.’s of these arithmetic means, then we 
compute the f.b. of g by formula (161. 1). Hence we have 


R = = 9.8696 VO.Ol + 0.16 

= 9.8696 X 0.4123 = 4.068 cm./sec.* = 4.1, say. 


To find the relative and percentage errors under the two suppositions 
(a) and (b), we have 


(a) 


Relative error = 


^9 _ 



2(0 002) 
1 


= 0.001 + 0.004 = 0.005. 


Percentage error = 100(Ay/y) = 100 X 0.005 = O per cent. 

(b) Since we are here dealing with a product of several quantities, we 
use formula (161.4). Hence 


I = V(j) + 4(y)*= V (0.001)* + 4(0.002)* 

= 0.00412 = 0.004, say. 

Percentage p.b. = 100{R/g) = 100 X 0.004 = 0^ percent. 

Example S. Two sides and the included angle of a triangle were 
measured with the following results: 

a = 252.52 ± 0.06 feet, 

6 = 330.01 ± 0.06 feet, 

C = 42®13W' ± 30". 


Find the area of the triangle and its p.b. 
Solution. The formula for the area is 


A = ^ab sin C. 

Hence 

dA ft sin C 3A _ o sin C dA _ab cos C 

^ ’ aft “ 2 * dC ~ 2 


Since the errors given in this problem are the probable errors of the 
given measurements, we should use formula (161.1). The use of that 
formula in this example, however, would call for a considerable amount 



Art. 162] 


ILLUSTRATIVE EXAMPLES 


617 


of numerical work. To avoid this we calculate the relative error by 
formula (161.2) and then get the p.e. from the relative error. Hence 
we have 

f = At) +( f)+<‘‘‘' 

The error in C must be expressed in radians. Hence 


AC = 30 X X 

^ 180 ^ 3600 

Also, cot C = cot 42® 13' = 1.1022. 


0.0001454. 




+ (0.0001454 X 1.1022)* 


= 0.00035. 


The area is 

, 262.52 X 300.01 X sin 42®13' 

A = r 


= 25452 sq. ft. 


.-. R- 0.00036 X A = 0.00036 X 25452 = 8.9 = 9, say. 
The required result is therefore 

A == 25452 ± 9 sq. ft. 


A 



B 


Example S. The distance between two inaccessible points A and B is 
desired to ± 0.1 foot. The required distance can not be measured directly 
but must be calculated from the measurements of CA, GB, and AACB. 
If a, b, and 6 (see Fig. 47) are approximately equal to 200 ft., 160 ft, 
and 45®, respectively, find the allowable errors in these directly measured 
quantities. 



518 


THE PRECISION OF MEASUREMENTS 


[Chap. XVII 


Solution. Here 
end 


c= Va* + 6* — 2abco»$, 

R = 0 . 1 . 


The best way to solve this problem is by the method of equal effects, and 
we therefore use formulas (1). Differentiating e with respect to a, b, 
and 0 in turn, we have 

do a — bcostf 8c _ 6 — o cos d 8c _ oft sin 9 

do c * db c * d9 c * 

But c = V40000 + 22500 — 30000 y”? = 141.7 

. 8c _ 200 — 75V2'_ 93.93 _ ^ 

• • 8o “ 141.7 ~ 141.7 “ ’ 


8c _ 160 — 100 V 2 _ 8.68 
86 — 141.7 “ 141.7 


0.060, 


ac 200 X 150 
d$ ■” 141.7 

and n = 3. 

Then by (1) we have 


21213 

141.7 


149.7; 


r.= 


0.1 


0.1 


yg9£ V3 X 0.66 


0.1732 

1.98 


= 0.087 = 0.09 ft. 


r» 

r$ 


0.1 


0.1732 


0.18 

0.1732 

V3 X 149.7 ~ 449.1 


V3 X 0.06 

0.1 


0.96 ft. 


= 0.000386 rad. = 1'20". 


The large allowable error in 6 is doe to the fact that 6 is nearly perpen- 
dicular to c, so that a considerable change in the former has little effect 
on the latter. 


Example 4- The modulus of elasticity of a beam of length I, 'breadth 
6, and depth d, supported at the ends and loaded at the center by a 
weight W, is given by the formula 


_ Wl* 
~ 4a6 d* 




where a is the deflection produced at the center. If it is desired to 



Abt. 163] 


REJECTION OF OBSERVATIONS 


S10 


measure ^ to 1 per cent, and the error in IF may be neglected, compute 
the allourable errors in a, b, d, and 1. 

Solution. The formula for E may be written 

E = ilFl*o-»5-»d-». 

This is of the form (161.3), where K=W/A. Then since B/E^l% 
= 0.01, we have from (161.4) 


Now using the method of equal effects, we have 

V 4 X 9 (y)*= 0 = 0.01. 


Likewise, 


0-01 J 1 

y = — y , and lOOy = g ~ 0-167 per cent. 

V 4 (x)*“ ^ 

.*. 100— =0.5 per cent. 


Ab 

lOOy = ^ per cent. 

100^ = 4 = 0.167 per cent, 
a 6 


Hence if the percentage p.b. of is to be 1 per cent, the percentage 
p.B.*s of a, h, d, 1, must not exceed i, ^ of one per cent, respectively. 

168. Rejection of Observations and Measurements. Occasionally some 

individual measurement may differ so widely from the others of the 
same set that we may suspect the discrepancy to be due to a mistake. 
In such a case it may be well to reject this measurement entirely. To 
decide what to do about it we apply the following rule: 

Find the mean of all the measurements (including the "wild" one) 
and find the residual for each. Compute the p.b of a single measwrement 
by formula (169.1). Reject any measurement whose residual exceeds 6 
times the p.b. of a single measurement. 

This rule rests on the following considerations: 

Suppose the chance of an error of magnitude x is 1 in 1000. Then 



520 THE PRECISION OF MEASUREMENTS [Chap. XVII 

its probability is p = 1/1000 = 0.001. The chance that an error of this 
size will not occur is therefore 1 — 0.001 = 0.999. From the probability 
table we find the corresponding to be 9.326. 

Now from Art. 166 we have 


from which 

Hence 


to two figures. 


hr = pz= 0.4769, 

. 0.4769 

A = . 


. 0.4769 

hfX — " X — 


r 

2.326 

0.4769 


r = 4.9r, 


The chance of making an error as great as five times the p.e. of a single 
measurement is therefore less than one in a thousand. An error of such 
a magnitude is therefore so improbable that we may safely neglect it. 

Example. A quantity ^ was measured with the results given below. 
Should. any of the measurements be rejected? 

if ='236, 261, 249, 262, 248, 264, 246, 267, 243, 274. 

Solution. The average of these measurements is 


Hence the residuals are 


if =261. 


«i = + 16, f>2 = 0, V, = + 2, t>4 = — 1, V, = + 3, t>, = — .3, 

02 = 4-6, o, = — 6, V2 = + 8, Vio = — 23. 

The P.S. of a single measurement is 

r = 0.6746 ^ ^26 + 4 + 1 + 9 + 9 + 25 + 36 + 64 + 629 
= 0.6746-X 10.01 = 6.76. 


Five times this P.B. is 33.76, and since all the residuals are leas than 
this we retain all the measurements. 


BXBRCI8B8 XYHI 

L The side h and the angles B and (7 of a plane triangle were measured 
with the following results: 

6 = 106 ± 0.06 ft., B = 28*36' ±r, C = 120*12' ± 1'.6. 

Find the migle A, the side a, and their p.x.’s. 



Art. 163] 


EXERCISES 


581 


8. Two sides a and b and the included angle C of a town lot were 
measured to be 

a = 104.86 ± 0.02 ft., h = 214.24 ± 0.03 ft., 

C = 47*18' ±1'. 

Find the side c and its p.x. 

3. The index of refraction of a prism is given by the formula 

8injH(a4^ 
sin 4a 

If 2> = 28*34' ± 0'.6 and a = 62*48' ± 0'.7, find n and its p.b. 

4. The current in a tangent galvanometer is given by the formula 

/ = f tan 6. 

Find I and its p.e. when K = 1.963 ± 0.002 and 6 = 35* ± 0*.l. 

6. The volume of a right circular cylinder is given by the formula 

V=^d^h. 

4 

Find ^ and its p.b. when h = 116.85 ± 0.28 mm. and d = 82.54 ± 0.28 mm. 

6. The diameter of a polished steel rod was measured ten times with 
the following results: 

0.5003, 0.5002, 0.4999, 0.4998, 0.4999, 0.5003, 0.5001, 0.5004, 0.5001, 

0.4999. 

Find the cross-sectional area and its p.b. 

7. Explain how you would decide in any given problem whether to 
use formula (6.1) or formula (161.1). What is the fundamental differ- 
ence between these two formulas? 



CHAPTER XVIII 


EMPIRICAL FORMULAS 

164. Introduction. An empirical formula, or empirical equation, is one 
whose form is inferred from the results of experiment or obserration and 
in which the constants are determined from experimental or observational 
data. Thus, it is known that the speed of a ship varies with the horse 
power according to the formula 

P = o + 67*. 

The constants to be determined in this formula are a and b, and for the 
purpose of determining them we should take several sets of readings of 
the speed and corresponding horse power. These sets of simultaneous 
values of V and P would, when substituted in the given formula, give 
several equations in the two unknowns a and b. The next thing to be 
done would be to find the best values for a and b from the several 
equations. For the solution of this part of the problem three methods 
are available : the graphic method or method of selected points, the method 
of averages, and the method of Least Squares. We shall now consider 
these methods in the order named and illustrate each by several example. 

166. The Graphic Method, or Method of Sdected Points. This method 
can be used whenever the given formula can be plotted as a straight line 
either directly or after a suitable transformation. The equation given 
above, for example, can be reduced to a straight-line form by putting 
V‘ = t, thereby reducing the equation to the form 

P = « + b<, 

which is linear in the variables t and P. 

To apply the graphic method to this problem we plot on coordinate 
paper the corresponding values of t(=: 7*) and P. The plotted points 
should lie nearly on a straight line. We then draw a straight line which 
will be a good compromise for all the plotted points and pass as near 
as possible to each of them. The slope of this line will be the value of 
b and its P-intercept will be a. If the line happens to pass through two 
of the plotted points, or through any other two points whose coor^nates 
are easily determined (points at the comers of squares, for instance), 
we can substitute their coordinates in the given equation and solve the 


522 



Art. 165] 


THE GRAPHIC METHOD 


523 


two resulting equations for a and 5, but the points so used should be as 
far apart as possible. The drawing of the best representative straight 
line is a matter of good judgment. 

This method will give fairly good results when finely divided coordinate 
paper is used, but in general it is not recommended except for obtaining 
approximate values of the constants or in cases where the results obtainable 
by the method are as accurate as the data used. 

Example 1. The electrical resistance of a copper wire varies with the 
temperature according to the equation 

R = a “1“ hT • 

For the purpose of determining the constants a and h the measurements 
of temperature and corresponding resistance given in the following table 
were made. Find the values of a and b. 


T 

19 1 

25 0 

30 1 

36 0 

40 0 

45 1 

50 0 

R 

76 30 

77 80 

79 75 

80 80 

82 35 

83 90 

85 10 



Fio. 48 

Solution. Plotting these pairs of values on a large sheet of paper and 
drawing what seems to be a good compromise line (Fig. 48), we find- that 
this line passes tKrough the points (21,77) and (64,89). Substituting 
in the given equation the coordinates of these points, we have 


34 






624 


EMPIRICAL FORMULAS 


[Chap. XVI 1 1 


and 


tt-|-21> = 77 
tt + 646 = 89 

436 = 12 6 = ^ = 0.2790, 

a = 77 — 21 X 0.2790 = 71.14. 


Hence the required relation between B and T is 

R = 71.14 + 0.2790r. 


To see how well this formula fits the data in the table we compute the 
residuals of the several measurements. Writing 

« = 0.2790r + 71.14 — /t, 

we have 

= 0.2790 X 19.1 + 71.14 — 76.30 = 0.15 
Vt = 0.2790 X 25.0 + 71.14 — 77.80 = 0.32 
v, = — 0.21 
V 4 — 0.39 

v, = — 0.05 
Vt = — 0.19 
Vt = — 0.01 

.'. 2v = 0.40, Sv* = 0.36. 

Example 2. The data in the following table fit a formula of the type 


(1) y = ax». 

Find the values of a and n and thence the required formula. 


X 

10 

20 

30 

40 

50 

60 

70 

80 

y 

1 06 

1 33 

1 52 

1 68 

1 81 

1 91 

2 01 

2 11 


Solution. Taking the logarithm of each side of the given equation, 
we have 

(2) logy = logo 4- n log X. 

Putting 

we get 


= loga. 


where a' 


y' = logy, x' = logx, 
y' = log o 4* = o' + fix'. 









Am Itt.')] 


THE (JKAPHIC METHOD 


625 


This is the equation of a straight line in the new variables t! and 
To plot this line the most conveniently we use logarithmic paper. 
Plotting the given points on such paper, we find that they lie almost 

Y 



exactly on a straight line (Fig. 49). Hence we substitute in (9) the 
coordinates of the first and last of the given points and get 

log 1.06 = log o -|- n, 
log 2.11 = log a n log 80 ; 


or 

Also, 


n + log a = 0.0253, 
1.9031n + log a = 0.3243. 
0.9r031n = 0.2990, 

n = 0.3311. 

log a = 0.0253 — n 

= 0.0253 — 0.3311 
= 9.6942 — 10. 
a = 0.4945. 


The required formula is therefore 

y — 0.4945 

Example S. Find a formula of the form 
(3) y = fce-** 


CMIMRK’AL FORMULAS 


IClIM- Will 


526 


which will fit the data in the table below. 


X 

1 

2 

3 

4 

5 

6 


8 

y 

15 3 

20 5 

27 4 

36 6 

49 1 

65 6 

87 8 

117 6 


Solution. Taking the common logarithm of each side of the given 
equation, we have 

(4) log y = log k + mx log e = log k (m log e)x, 

or 

y' = log fc + (m log e)x, where y' = log y. 

This is the equation of a straight line in the variables x and y'. To plot 
it wo use somilogarithmic paper. Plotting the given values of x and y 
on semilogarithmic paper, we find that the points lie nearly on a straight 
line (Fig. 50). Drawing on a large sheet of paper what seems to be a 


Y 



Fio. 50 







Aur KmI THK graphic method 627 

good representative line, we notice that it passes through the points (0.4, 
13) and (8.6, 140). Substituting these values in (4), we have 

log 13 = log fc + 0.43489m (0.4) = log i + 0.1737m, 
log 140 = log fc + 0.43429m (8.6) = log *? + 3.7349m. 

Solving these equations for m and Jk, we get 

m = 0.2898, 
k = 11.58. 

The required equation is therefore 

y = 11.58e« 


Note. In logarithmic coordinate paper the origin is the point (1,1). 
Hence the equations of the axes are x — 1, y = 1. Putting x = 1 in the 
equation y = ax'*, we get y = a. Hence in th^ straight-line graph of the 
equation y = ax" on logarithmic paper the constant a is the y-intercept. 

To find a formula for the exponent n, let (xi, yt) and (X 2 , ya) be anv 
two pairs of corresponding values of x and y. Then from (2) 


or 

(5) 


log y* = log a 4- n log x, , 
log y, = log 0 -t- n log Xi . 

.'. logy* — logy, = n (log X* — log Xi), 
„ _ — logyi 

log x* — log X, ■ 


The origin of coordinates in semilogarithmic paper is the point (0, 1). 
The equation of the y-axis is therefore x = 0, and that of the x-axis is 
y = 1. Putting X = 0 in (3), we get y = k. Hence in the straight-line 
graph of the equation y = ke”** on semilogarithmic paper the constant k 
is the y-intercept. 

To find a formula for the exponent m we substitute in (4) two pairs 
of corresponding values of x and y, obtaining the two equations 

log y 2 = log fe -f (m log «)X 2 , 
log y, = log fc -1- (m log c)x, . 

logy* — logy, = (*2 — ®,) m logs, 
or 

„ ^ iogy.-Hy. ^ 2.3028 

(x* — x,)logs 


( 6 ) 


x* — X, 



528 


E.MIMHICAL FORxMl’LAS 


[Chap. XVHI 


If the given points are so plotted that the equations of the axes are 
not as stated above, the ^-intercept will not be the value of the constant 
a or k. For instance, in Example 2 we plotted the point (10, 1.06) on the 
y-axis. This is really equivalent to making the substitution x = 10®', 
so that the given equation is transformed into the equivalent equation 

y = o(10®')" = a X 10»®'» 

Putting x' = 1, we get y = o X 10" = 0.4945 X 10® *»“ = 1.06, and this 
is the actual plotted value of the y-intercept. The student should have no 
difficulty in deciding whether or not the y-intercept of the plotted straight 
line gives the true value of the coefficients a and k in any given example. 

166. The Method of Averages. The resvluah of a scries of plotted 
points are the vertical distances of these points from the best representative 
curve. Some of the residuals will be positive and others negative. The 
method of averages assumes that the best representative curve is that for 
which the algebraic sum of the residuals is zero. To find the unknown 
constants in an empirical formula by this method we first substitute in 
the given formula the several pairs of observed or measured values of x 
and y. We thus get as many residuals as there are pairs of observed 
values. Then we divide the residuals, or residual equations, into as many 
groups as there are constants in the assumed formula. Each group should 
contain as nearly as possible the same number of residuals. By placing 
the sum of the residuals in the first group equal to zero we get a single 
equation in the unknown constants. Placing the sum of the residuals in 
the second group equal to zero, we get a second equation in the constants, 
and so on. Since the sum of the residuals in each group is zero, the sum 
of all the residuals is necessarily zero. On solving simultaneously the 
equations obtained from the several groups, we obtain the values of the 
unknown constants in the original formula. A few examples will make 
the method clear. 

Example I. The data in the following table will fit a formula of the 
type 

(1) y = o -f 6® + c®*. 

Find the formula. 


X 

87.5 

84.0 

77 8 

63 7 

46 7 

36 9 

y 

292 

283 

270 

235 

197 

181 



Art. 166 ] 


THE METHOD OF AVERAGES 


529 


Solution. Substituting in (1) the several pairs of corresponding values 
of X and y, we get 

= o + 87.66 + 7656c — 292 ' 

{va = a + 84.06 + 7056c — 283 

(.. = . + rr.8» + 6063 . - 370 I K«.duri 

^ 63.76 + 4068. - 336 f equations. 

U, = 0 + 46.76 + 3181. — 197 
^ (V. = o + 36.96 + 1363. — 181 

Dividing these equations into three groups (since there are three con- 
stants to be determined), as indicated by the braces at the left, adding 
the equations of each group, and placing the sums equal to zero, we get 
the three equations 

2o + 171.56 + 14712c = 575 ' 

2o -I- 141.56 4- ionic = 605 . 

2a + 83.66 -f 3543c = 378 

Solving these three equations simultaneously for a, 6, and c, we get 

0 = 107.72, 6 = 1.7960, c = 0.0036036. 

Hence the required formula is 

y = 107.72 -f 1.7960X -|- 0.0036036x*. 

This method of averages requires no graph and can be applied to any 
formula which is linear (of the firat degree) in the unknown constants 
or to any formula which is reducible to a form linear in the constants. 

Example 2. Solve Example 2, Art. 166, by the method of averages. 

Solution. Strictly speaking, the residuals are, by definition, 

», r= ox," — jf, , V, = ox,* — jf,, etc. 

But if we divide these equations into groups, add, and attempt to solve 
the resulting equations for a and n, we get into trouble at once; for the 
unknown n occurs as an exponent in several terms of a sum. 

We can avoid this trouble without much loss in accuracy by proceeding 
as follows: Instead of equating to zero the sum of the residuals of the 
y’s, we equate to zero the sum of the residuals of the loyarithme of the y's. 
For any residual we have from (2) of Art. 165 

!>' = logo nlogx — logy. 



530 


EMPIRICAL FORMULAS 


(Chap Will 


Hence the several residuals are 

V, z= log o + l.OOOOn — 0.0853 
o', = log o + l.SOlOn — 0.1239 
* 1^8 = log o 4 ’ 4771n — 0.1818 
y* ^ log a 4- 1.6021n — 0.2253 

V, = log 0 4- 1.6990n — 0.2577 
jj i/t = log o 4- 1.7782n — 0.2810 
1 = log o 4- 1.8451n 0.3032 

t/s = log o 4* 1.9031n — 0.3243. 

In actual practice we do not write down these equations in this form, 
but in the form given below: 

log a 4- l.OOOOn = 0.0253 log a + 1.6990n = 0.2577 

log a 4- 1.3010n = 0.1239 log a + 1.7782n = 0.2810 

log a 4- 1.4771n = 0.1818 log a 4- 1.8451n = 0.3032 

log a + 1.6021n = 0.2253 log a 4- 1.9031n = 0.3243 

(2) 4 log a 4- 5.3802n = 0.5563. (3) 4 log o 4- 7.22Hn =Tl662. 

Solving (2) and (3) simultaneously, we get 

n = 0.3305, log o = — 0.3055 = 9.6945 — 10 .’. o = 0.4949. 

The required formula is therefore 

y = 0.4949*® »»®». 

Note. The method of averages is the shortest and easiest method for 
finding the constants in an empirical formula, but it must not be used 
blindly. The residual equations can be grouped in several ways,* and 

* The number of possible groupings is given by the following formulas: 

a) Two groups. The number of different ways in which p + q different things can 
be divided into two groups of p things and q things, respectively, is 

(P + g) 1 

p\q\ 

h) Three groups. The number of different ways in which p + q r different 
things can be divided into three groups of p things, q things, and r things, respec 
lively, is 

^ P + 9 + r)\ 
plqlrl 

c) Four or more groups. The number of ways in which we can divide p [ q-\ r-f a 
different things into four groups of p things, q things, r things, and s things, 
respectively, is 

(P4g 4 v 4>)» _ 
p*qtr\sl 

And so on for any other case. For the proof of these formulas see Wentworth's 
College Alegbra, pp. 263-264; or Whitworth's Choice and Chance^ pp. 63 64 



Aut. KM) I 


TIIK METHOD OF AVERAGES 


531 


each different grouping will give different values for the unknown con- 
stants, even though the algebraic sum of the residuals be zero in every 
case. The resulting formulas will thus be different, and some of them 
will fit the data much better than the others. 

There is no way to determine in advance just what grouping will give 
the best result. As a general rule the best formula is obtained by grouping 
the residual equations in consecutive order, as was done in Examples 1 
and 2. The following example will serve to clear up the matter of 
grouping. 

Example S. Find by the method of averages a formula of the type 

y = a -f 5** 

which will fit the following data: 


X 

5 

7 

9 

11 

12 

y 

290 

560 

1044 

1810 

2300 


Solution. The residual equations are 

Vi = a -f 1256 — 290 
Va = o -f 3436 — 660 
v, = a-|- 7296 — 1044 
= a -I- 13316 — 1810 
V, = 0-1- 17286 — 2300. 


The number of possible groupings of these equations is 5!/(3!2l) = 10. 
The ten different groupings and the resulting formulas corresponding to 
them are given below. 


f»i 

\ y = 130.87 -f- 1.2570X*. 


’«s 

So = 0.000010, 20* = 64.066. 

.»• 


]v, y = 128.86 -H 1.2693X'. 

Ui 

] 2» = 0.000071, = 72.600. 













532 


EMPIRICAL FORMULAS 


IChap. XVIII 


8 . 


4. 


6. 


7. 


8 


r* 

\ y = 129.68 + 1.2584X*. 
lvi 

fv, 

]v. 2t»= 0.00028, 2t;» = 66.668. 


|t>i 

] y = 158.91 + 1.2240X*. 

l»* 

ft>, 

]v, 2® = —0.000017. 2v* = 2038.694. 


Vi 

y — 135.95 4- 1.2510X*. 

V* 

Vi 

Vi 2® = —0.000004, 2v* = 132.197. 
V* 


\Vi 

] y = 253.69 + 1.1127x*. 

[vi 

[Vx 

U. = — 0.000001, 2»* = 37626.548. 

W 

[Vi 

jt>, y = 187.76 + 1.2489X*. 
f»i 

I 2v = — 0.000002, 2»* = 187.828. 
[vi 

{«, y = 128.96 + 1.2661X*. 

\Vi 

j 2» = 0.0000001, 2»* = 177.705. 

L®4 



Akt. 167] 


THE METHOD OF LEAST SQUARES 


533 


9. 


10 . 


Va 

y = 

V* 


Vi 


V* 

2*; = 

Vt 


Va 

y = 

Vi 


Vi 


Va 

2t; = 

.V* 



123.84 + 1.2652X*. 

— 0.000002, St;* = 181.389. 

142.23 + 1.2436x». 

— 0.000001. = 393.303. 


The best formulas are those for which is least and are evidently 
1, 2, 3. The poorest are 4 and 6. 

The best formula obtainable is found by the method of Least Squares 
to be 

y = 130.71 + 1.2572i>, 


for which Sv = — 0.0000016 and S*'* = 64.004. 

A carefully constructed graph, obtained by putting x* = u and plotting 
the straight line y — a-\-bu on & large sheet of finely squared paper, gave 

y = 125 4- 1.33x», 

for which S® = 281.48, 2®* = 25460.154. This formula obtained from 
a good graph is far inferior to nine of the ten formulas obtained by the 
method of averages. 

When the number of residual equations is large enough to allow three 
or more to each group, the method of averages can be depended upon 
to give good results. If we have only a few sets of data (readings or 
measurements) and can not easily obtain more, we should always use the 
method of Least Squares. This method gives only one formula and that 
is always the best possible one. 

Every empirical formula, however obtained, should always be tested by 
computing .the residuals and seeing whether they are within allowable 
limits. 


197. The Method of Least Squares. This method says that the bert 
representative curve is that for which the sum of the squares of the 
residuals is a minimum. Since the squares of the residuals are positive 
quantities, the requirement that their sum shall be as small as possible 
ensures that the numerical values of the residuals will be small; and this 



EMPIRICAL KORMI LAS 


IClIM-. Will 


.>34 

means that in the case of a series of plotted points the best representative 
curve will pass as closely as possible to all the points. Before applying 
this method to empirical formulas we shall first derive a fundamental 
rule which reduces the method to a simple procedure. 

For simplicity let us consider the formula 

(1) y = o + + cx* 

and find the values of a, b, and c which will make the graph of (1) pa.ss as 
near as possible to each of the n points {X 2 ,y 2 ),' ‘ * {x„,yn); 

or, stated otherwise, let us find an equation of the form (1) which will 
be satisfied as nearly as possible by each of the n pairs of observed values 
{Xuyi)t {X 2 ,yi),‘ • -(x^y*). The equation will not, in general, be 
satisfied exactly by any of the n pairs. Substituting in (1) each of the n 
pairs of values in turn, we get the following residual equations: 


( 2 ) 


Vj = 0 + 6xi + cx,* — y, , 
t>2 = o -}- ftxj 4- cxj® — yj , 


= a + 6x, + cx„® — y„ . 


The principle of least squares says that the best values of the unknown 
constants a, b, and c are those which make the sum of the squares of the 
residuals a minimum, or 

= r,* + t’2* -f • -4 »’n* 

must be a minimum. Hence 

2(0 4- 6x 4- cx* — y)* = (o 4- bxi 4- cx,* — y,)* 4" (« + + cxt* - y,)» 

4- • • • 4- (a + &*» + cx»* — y„)* = /(o, b, c) 
is to be a minimum. 

The condition that /(a, b, c) be a maximum or a minimum is that its 
partial derivatives with respect to a, b, and c shall each be zero. We 
therefore have 

^ = 2(0 4- 5xi 4- cxi* — y,) 4- 2(a 4- bx, 4- cx** — y*) + • • =0, 

^ = 2 (o 4- bxi 4- cxi* — y,)xi 4- 2(o 4- ftx* 4- cx** — y*)x, 4- • =0, 

^ = 2 (o 4- 6x, 4- cxi* — y,)xi* 4- 2(o 4- 6x, 4- cx** — y,)x** 4- • • =0. 



Art. 167 1 


THE ME'lHOl) LEAS'l .>Ql'AKES 


535 


Dividing through by 2, we get the following three normal equations: 

(a + bxi 4- ci,* — jfi) -|- (o + bx 2 + cxt* — y*) 

+ • • • + (a + 6x, + cx»* — y„) = 0, 

*i(o + bxi -j- cxi* — jfi) 4- X2(a 4- bx2 4" CX2* — ^ 2 ) 

+ ■ ■ • + Xn(a 4- bXn 4- cx„* — y,) = 0, 

+ bxi -f- cx,^ — y,) 4. ®j*(o 4- bx2 4- CX2* — y2) 

4. • • • 4- Xn*(a 4- bxn 4- cx»* — y,) = 0. 

It will be observed that these normal equations can be written down 
immediately by applying the following 

Rule : To find the first normal equation multiply the right-hand member 
of each residual equation by the coefficient of the first unknown in that 
member, add the products thus obtained, and equate their sum to zeroj 
to get the second normal equation multiply the right-hand member of 
each residual equation by the coefficient of the second unknov>n in that 
member, add the products so obtained, and place their sum equal to zero; 
and so on for the remaining normal equations. 

The normal equations are solved by the ordinary methods of algebra 
for solving simultaneous equations of the first degree in two or more 
unknowns. When there are several equations and the coefficients contain 
several digits, solve by the methods of Arts. 96 or 102. 

The number of normal equations is always the same as the number of 
unknown constants to be determined, whereas the number of residual 
equations is equal to the number of observations. The number of observa- 
tions must always be greater than the number of undetermined constants 
if the method of least squares is to be of any benefit in the solution. 

The rule above is applicable to any formula which is linear tn the 
constants or to any formula which can be reduced to a form linear in 
the constants. 

Example 1. Find the equation of the straight line which comes nearest 
to passing through the following points: 




Solution. Let the equation of the line be 


y = o 4* 6x. 

Substituting in this equation the several pairs of values of x and y, we 
get the following residual equations: 







536 


EMPIRICAL FORMULAS 


[Chap. XVIII 


V, = o -f 0.66 — 0.31 

v, = a + 6 — 0.82 

w, = a + 1.56 — 1.29 

i>4 = o + 26 — 1.85 

V5 = a + 2.56 — 2.51 
Ve = a 36 — 3.02 


Besidual equations. 


Adding the right-hand members and equating their sum to zero, we get 

6a -1- 10.56 — 9.80 = 0. 

Multiplying the right-hand member of the first residual equation by 0.6, 
the second by 1, the third by 1.5, etc., adding the products, and equating 
their sum to zero, we get 

10.5o + 22.756 — 21.945 = 0. 

Hence the normal equations are 

— 21 945 1 equations. 

Solving these by determinants, we have 


6a -f 10.56 = 
10.5a -f 22.756 


a = 


6 = 


9.80 

21.945 

10.5 

22.75 

_ 222.950 — 

230.422 

_ 7.472 

1 6 

10.5 

“ 136.50 — 

110.25 

~ 26.26 

1 10.5 

22.75 




6 

9.80 




10.5 

21.945 

_ 131.670 — 

• 102.900 

_ 28.770 

26.25 

26.26 

~ 26.25 ■ 


= — = — 0.285. 


= 1.096 


= 1.10, say. 

The required equation is therefore 


yz= — 0,285+ l.lOx. 

Computing the residuals by substituting the given points in this formula, 
we have 

t>i = — 0.045, V* = — 0.005, V, = 0.075, 

V 4 = 0.065, », = — 0.045, v» = — 0.005. 

.'. 21; = 0.04, 2v* = 0.014. 


Example B. Find a formula of the form 

y = o + 6* + c** 



AIIT. 1671 TUK MKTllOI) OF LKA.ST S(Jl AH1> 

which will fit the following data: 


riS? 


X 

0 

0 1 

m 

ra 

0 4 

0 5 

0 6 

07 

08 


y 

3 1950 

3 2299 



3 2516 

3 2282 

3 1807 

3 1266 

3 0594 

2 9759 


Solution. Substituting in the assumed formula the corresponding values 
of X and y as given in the table, we get 


Vi = a + 06 -{■ — 3.1950 

Vi = a-\- 0.16 -t- 0.01c — 3.2299 
t), = fl + 0.26 + 0.04c — 3.2532 
= o + 0.36 + 0.09c — 3.2611 
11 , = a + 0.46 + 0.16c — 3.2516 
11 , = a + 0.56 + 0 25c — 3.2282 
11 , = a + 0.66 4- 0.36c — 3.1807 
11 , = o + 0.76 4- 0.49c — 3.1266 
11 , = o + 0.86 + 0.64c — 3.0594 
11,0 = o 4- 0.96 4- 0.81c — 2.9759 


Residual equations. 


.\pplying tho rule of page 535 to Uicm* (‘(piations. we get 


lOo 4- 4.56 4- 2.85c = 31.7616 ' 
4.5o 4- 2.856 4- 2.025c = 14.0896 . 

2.850 + 2.0256 4- 1.5333c = 8.82881 


Normal equations. 


Solving these for o, 6, c, we find 

a = 3.1951, 

6 = 0.44254, 
c = — 0.76531. 

Hence the required equation is 

» = 3.1951 4- 0.44254 J — 0.76531g*. 

If we compute the residuals by substituting in this formula the values 
of X and y given in the table, we find 

2ii = 0.0001, = 0.0000649. 

The following example is given to illustrate how the solution of a 
problem in a routine, perfunctory manner can lead to a worthless result. 
The first computation is the perfunctory one in which the work is done 
in a routine, careless manner. The second computation improves on the 



















538 


EMPIRICAL FORMULAS 


[Chap. XVIII 


first by preventing errors of computation in the evaluation of the deter- 
minants. The third computation prevents errors of computation from 
the very beginning. 

Example S. The indicated horse power, I, required to drive a ship of 
displacement D tons at a ten-knot speed is given by the following data. 
Find a formula of the form I — aD* which will fit the data. 


D 

1720 

2300 

3200 

4100 

I 

665 

789 

1000 

1164 


Solution. We have 


I = oD*. 

log/ = loga-f nlogi>. 


The residuals are really 


z= aDi* — Ii, Vt = aDt* — /* , etc., 


but we save a great deal of labor and commit very little error by writing 


( 4 ) 


Vj = log o -f- n log /?i — log /i , 
, v'jislogo-l-nlog/), — log/,, 
etc.. 


and making the sum of the squares of the t^’s a minimum. 


(a). Perfunctory Computation. Substituting in these equations the 
corresponding values of D and /, we get 


i/, = log o + 3.236n — 2.816 ’ 
o', = log a 4* 3.362n — 2.897 
o', = log a 3.505» — 3.000 
o', = log o -f- 3.613n — 3.066 


Besidual equations. 


Since these equations are linear in the constants n and loya, we can 
apply the rule stated on page .j3o. Adding the right-hand nieiiihers and 

equating their sum to zero, we find the first normal equation to be 


41oga4-13.716n = 11.779. 

Multiplying the right-hand member of the first residual equation by 
3.236, the second by 3.362, etc., adding the products, and equating their 
sum to zero, we get 


13.716 log a + 47.11n = 40.445 









Akt. 167 ] 


THE METHOD OF LEAST SQUARES 


6S9 


for the second normal equation. Rounding off these numbers to foui* 
figures, we have 


12 log a I Normal equations. 

41oga = 11.78 j ’ 

Solving these equations by determinants, we have 


47.11n + 13.721oga = 
13.72n + 



40.44 

11.78 

13.72 

4 

161.76 — 161.62 

__ 0.14 


47.11 

13.72 

13.72 

4 

~ 188.44 — 188.24 

~ 0.20 


47.11 

13.72 

40.44 

11.78 

554.96 — 5.54.84 

_ 0.12 


0.20 

~ 0.20 

“ 0.20 


= = 0.600. 


.'. 0 = 3.981. 

The resulting formula is therefore 

I = 3.98/)® "®®. 

Computing the residuals by substituting the data in this formula, we get 
t>i = — 77, Va = — 108, Vs = — 131, v« = — 182. 


Hence 


Sv* = 67,878. 


The formula which we have found is evidently so poor as to be worth- 
less; for the residuals are large, all of the same sign, and the sum of their 
squares is exceedingly large. 

The results would have been far worse if we had rounded off to four 
figures the products obtained in evaluating the determinants, for in that 
case we would have had 


n = 
logo = 


161.8 — 161.6 0.2 _ 
188.4— 188.2 “ 0.2 — ^ 

SS.'i.O - .’»54.8 0.2 

0.2 “ 0.2 ~ 

.'. a = 10. 


Hence the formula would have been 

I = lOD, 

which is totally worthless in this case. 


35 



540 


EMPIRICAL FORMULAS 


[Chap. XVIII 


The poor result obtained above ia due primarily to the fact that in the 
process of solving the normal equations three of the most important 
significant figures disappeared by subtraction (see Art. 7) ; for n and logo 
were determined from the simple fractions 0.14/0.20 and 0.12/0.20, 
respectively, in each of which the second figure in both numerator and 
denominator is doubtful. This loss of significant figures did not seriously 
affect n, but in the case of a the effect was disastrous. The reason for 
the greater effect on a is this: An error c in logN will cause an error 
2.3026 iiTc in the antilog (Art. 7). 

(b). Improved Computation. Treating the elements of the deter- 
minants as exact numbers and retaining all the figures in the products, 
we have 


n 


40.44 

13.72 




11.78 

4 

_ 161.76 - 

-161.6216 

_ 0.1384 

47.11 

13.72 

13.72 

4 

188.44- 

- 188.2384 

0.2016 


= 0.6865. 


logo = 


47.11 40.44 
13.72 11.78 


0.2016 


554.9558 — 554.8368 _ 0.1190 
0.2016 0.2016 


= 0.5903, 


whence 


a — 3.893. 


The resulting formula is therefore 

I — 3.893D® ••«. 

The residuals in this case are 

= 7.16, Vi — — 1.88, Vs = 7.87, Vs = — 12.1 ; and 
Sv* = 263.1. 


(c). Accuraie Solution. 


One way to get the required constants correct to four significant figures 
in this example is to solve the problem anew and carry all computations 
to eight significant figures, so that we ahall have five left after the first 
three disappear by subtraction. We therefore make a new computation, 
using 7-place logs. The results are as follows: 



aht. les] 


WEIGHTED RESIDUALS 


641 


161.773154 — 161.554946 _ 0.21821 _ 
188.43257 —188.10644 0,32613 


loga = 


554.89958 — 554.68739 
0.32613 


0.21219 
” 0.32613 


= 0.65063 


and 


11 = 4.4733 = 4.473, say. 


Hence the final formula is 


7 = 4.473 !>**”*. 

The residuals are found to be 

i*, = — 1.1, r, = 5.2, — 9.4, V 4 = 5.3; 

and therefore 

2» = 0.0, 2»* = 144.7. 

Note. This example serves to bring out an important point which must 
be kept in mind when determining the constants in empirical formulas. 
The point is this: The data used in determining the constants should be 
treated as exact numbers, and the computer must be careful about rounding 
off and dropping seemingly superfluous digits at any stage of the com- 
putation. The final values of the constants should be given to as many 
significant figures as are given in the original data. 

When it happens that some of the most important significant figures 
disappear by substraction, as in the example above, the computation must 
be carried through with enough significant figures at all stages to give a 
reliable result. As a general rule it may be stated that if the constants 
are desired to m significant figures and if a preliminary calculation shows 
that the first p figures will disappear by subtraction, the calculation must be 
performed with m 4* P + 1 significant figures throughout from beginning 
to end. 

In the solution of systems of linear equations the occasional loss of the 
leading significant figures by subtraction cannot be prevented, but the 
harmful effect of such loss can be lessened by preventing subsequent errors 
of computation. 

Here it may be remarked that the above rather trivial example has also 
been worked by two other methods: (1) by moving the origin to a point 
near the middle of the interval and (2) by the general method of Art. 169. 
All three methods gave the same result to four significant figures. 

168. Wti^ted Rasiduala. It sometimes happens that the residuals are 



543 


EMPIRICAL FORMULAS 


[Chap. XVIII 


not all of the same weight. This is the case when we use the residuals 
of a function of y instead of those of y itself. In £z. 2, Art. 165, and 
Ex. 3, Art. 167, for example, we found it necessary to use the residuals 
of logy instead of those of y. In these cases the residuals were no longer 
of equal weight, as we shall now show. 

Using the notation of Art. 161, let 


Then 




0y 


r(y). 


Substituting this in (161.1), we get 

B = ny)r, 

where ** denotes the P. B. of y and R the p. b. of f{y)- Hence 


R 

r 


= r{y 


Since the same relations hold between residuals as between probable 
errors, we may write 

R_y_ 

r ~~ V ' 


where v and V denote the residuaN of y and /(y), respectively. 


y 

V 


= riy)- 


Hence 


Denoting by Wy and Wf the weights of y and f{y). respectively, we have 
from (15ri. *3) 


tVf r* r* 1 

Vy~w~v^-[r{y)y • 


(168.1) 


Wf - 


Wy 

[F(i)r’ 


Now if /(y) = logiu y = M log, y, where M — 0.43429, we have 


Hence from (168.1) 



Wf = 


If* ’ 


and if all the y’s are of equal weight, then Wp = l and we have 



Art. 168] 


VVKKJHTED RESIDUALS 


543 


( 168 . 3 ) 

We shall next derive the fundamental rule for writing down the normal 
equations when the residuals have different weights. 

By Art. 148 the best result obtainable from measurements of unequal 
weight is that for which the sum of the weighted squares of the residuals 
is a minimum. Hence we must have 

= WiVi* + * 02 V 2 ~ + ■ ■ ■ + » minimum. 

In the case of the equation y — a ix cx* (Art. 145) we therefore have 

to, (a + hxi + c®!* — yO* + v) 2 {a + 5®* + c®** — + • • • a minimum. 

Calling this expression f{a,b,c), taking the partial derivatives with 
repect to a, 5, c in turn, and equating each to zero, we have 

df 

= 2tt;,(o + 6®j -f- c®i* — y,) 4- 2M>2(a-l- 5x.. -j- c®2* — y2) + ' • = 0, 

z= 2wiX,(a + bxi 4* c®i* — y,) 4* 2«>2®,,(o 4- bx 2 + c®2* — y2) 4* ‘ ‘ ‘ = 0> 

00 

= 2wiXi*(a 4- bxi 4- c®i* — yO 4- 2u;2®2*(o 4- ft®2 4- cxt* — y2) 4- ‘ • = 0, 

Hence on dividing through by 2 we get 

iPi(o 4- 6*1 + c*i* — yi) + “'*(<» + 6®2 4- c*2* — yi) 

4- • • • + «'n(o 4- 6*<. -H — y») = 0 
Wi*i(<» + 6*1 + c*i* — yi) + «'2*2(o 4- 6®2 4- c*2* — yz) 

4- • • • 4- w„Xn{a 4- 6®, 4- Cl** — y») = 0 
«i*x*(« + 6®i 4- c®i* — y,) 4- W2®2*(a 4" 6®2 4* c*2* — yz) 

4 - ... 4- WnXnia 4- 6®* 4- c®** — y*) = 0 

In the case of weighted residuals we can therefore write down the 
normal equations according to the following 

Rule: To get the first normal equation multiply the right-hand side 
of each residual equation by its weight and by the coefficient of the first 
uriknown in that equation, add the products thus obtained, and equate 
their sum to zero; to find the second normal equation multiply the right- 
hand member of each residual equation by its weight and by the coefficient 
of the second unknown in that member, add the products, and equate 
their sum to zero; and so on for the others. 

We shall now work Ex. 3 of the preceding article by the method of 


Weighted 

normal 

equations. 




644 


EMPIRICAL FORMULAS 


[Chap. XVIII 


weii^tt. By (168.3) the weights of the xesiduals are 1^/M\ 

end but sinoe the factor 1 /Jf* will dinde out in the nonnal 

equations we do not write it down at all. The solution given below should 
be self-ezpanatoiy. * 


.*. log/ = logo + A log D.. 


D 

1720 

2300 

3200 

4100 

I 

656 

789 

looo 

1164 

P 

429025 

622521 

1000000 

1354896 


Vi = log a + 3.2355384fi — 2.8162413 
= log o 4- 3.3617278n — 2.8970770 
V, =: log a + 3.5051500n — 3.0000000 
04 = log a -{- 3.6127839n — 3.0659530 


Weights 
429025 ' 

622521 Besidual 
1000000 ' equations. 
1354896 


Now applying the rule for writing down the weighted normal equations, 
we find them to be 


11880965.2n 4* 3406442 log o — 10165776.6 ) Weighted normal 
41497013.1n 4* 11880965.2 log a = 35495260.6 | equations. 

Solving these by determinants, we find 

n=: 0.6671, 0 = 4.546. 

The required formula is therefore 

/ = 4.546P”””. 

The residuals are found to be 


Vi= — 0.3, Vt = 5.8, 0 , = — 9.4, O 4 = 4.8. 

.•. = 0.9, = 145.1. 

Here the values of 2v* slightly larger than in the unweighted 

previous solution, but the lack of improvement is not the fault of the 
weighting method. It is due to the singular nature of the e.xaniple treated. 
After applying this weighting method to several simple examples of different 
types and comparing the results with those obtained by ignoring differences 
in weight, the author is of the opinion that ordinarily it is not worth 
while to bother about the weights of the residuals ; but problems sometimes 
arise in which the weights must be considered.* 


* For a strikiog example of the effect of weighting in some probleme eee an lm« 

portant paper by C. E. Van Oretrand: **On the Empirical Representation of Certain 















NON-LINEAR FORMULAS 


545 


Remark. Since the weights in the preceding example are approximately 
as the numbers 43, 62, 100, and 135, the student may wonder why it is 
not sufficient to multiply the residuals by these smaller numbers instead 
of by the actual weights 429025, 622521, etc. The answer is that if we 
did this the corresponding products would be true to only two or three 
significant figures and these would disappear in this problem by subtraction 
in solving the normal equations, so that the results found would be very 
uncertain. We can state as a general rule that the number of significant 
figures used in the weights must not be less than the number of significant 
figures which are to be retained throughout the computation, unless the 
exact values of the weights happen to contain fewer figures than the 
number retained throughout the computation. 

169. Non-Linear Formulas. — ^The General Case. Not all empirical 
formulas can be handled by the methods thus far considered. For example, 
the relation between the pressure p and temperature t of saturated steam 
can be expressed by a formula of the type 

p = o(10) 

where a, b, c, are unknown constants. Thesb constants do not enter the 
formula linearly, and no transformation of the formula will give a linear 
relation among them. Consequently they can not be determined by the 
methods previously given. We are now going to develop a method which 
will apply to any type of formula, however complicated it may be. 

Let us consider a formula involving two variables, x and y, and three 
undetermined constants, a, ft, e. Such a formula may be written in the 
symbolic form 

(1) y = /(af,o,ft,c). 

Let So, fto, Co be approximate values of a, ft, c, obtained from a graph 
or by any other means, 'and let a, y denote corrections which are to be 
applied to Oo, fto, Co, respectively, so that 



' a = Oo + a. 

(2) 

• ft = fto + 


c = Co 

Then 


(3) 

/ = /(X, Oo, fto, Co) 



346 


KMFIllICAL FOllMULAS 


IChap. XVlIl 


will be a function whose graph approximates the graph (1) more or 
less closely. The values of this approximating function corresponding to 
* ‘Xu will be 

3^1 “ /(®1» ®0> ®o)» 

y^% — f (^fjj uo> bo» co)> 


(4) 


y n — ®0> ^0> ®o)* 


If we take (1) to be the best or most probable function and its graph 
to l>e the best representative curve, then the residuals will be 


(5) 


»i = /(*!, a, 6, c) — y, 
t >2 = /(**, a, h, c) — y* 

Vn = f{xn,a,b,c)—yn, 


where y,, yz,- • • y« are the observed y's corresponding to x„ Xj, • • • x, , 
respectively. Substituting in (6) the values of a, h, c as given by (2), 
we have for the first residual 

= fixi, flo + <*> + P, Co + y) — yi , 

or 


(6) 


Vi + yi = /(®i, Uo 4- «> 6o + A Co + y). 


Considering the right-hand member of (6) as a function of a, b, c and 
expanding it by Taylor’s theorem for a function of several variables, we 
have 

(7) -f y, = /(x„ Oo, 6o, Co) 

f(a. »(»).+>©) 

-f- terms involving higher powers and products of a, /3, y, 
where (0/,/9o)o means 

a = Oo 
6 = bo 
b Co 

Then since /i = /(xt, Oo, bo, Co), (7) becomes 

• (^),+ /» (^),+ r (^) + • • • , 



Akt. 160J 


NON-LINEAR FORMULAS 


647 


*'* - * (^),+ ^ (^),+ ^ (^)o+ • 

Let 

^x = y'i—yi, Ti — y\ — y2, • • • r, = /,— y, . 

Then the residuals become 


( 109 . 1 ) 


"■ - “ ^ (^).+ ’’ (^). 


Residual 

equations. 


These equations are linear (of the first degree) in the corrections oi,p,y, 
and we may therefore deal with the problem from this point onward 
either by the method of averages or by the method of least squares. If we 
use the latter method, we write down the normal equations by the rule 
.stated on page 535. 

The quantities r,, fa, • • r* are the residuals for the approximation 
curve y' = /(*,Oo, &o, Co), since they are the differences between the 
observed ordinates and the ordinates to this curve. 

We shall now apply this general method to two examples. 

Example 1. Find a formula of the form 

y = mx 4- b 

which will fit the following data: 


27 

33 

40 

109 9 

112 0 

114 7 


Solution. When these values are plotted on ordinary coordinate paper, 
the points are found to lie nearly on a straight line (Fig. 51). The line 
which seems (to the eye) to fit them best has a slope of 0.37 and a 
y-intercept of 99.7. Hence we take 

0.37, b^ 99.7. 

The approximation curve is therefore the line 

/ = 0.37* + 99.7. 







«48 EMPIRICAL FORMULAS ICiiap. XVIII 

Substituting in this equation the observed values of x, we get 

y'l = 0.37 X 27 + 99.7 = 109.7, 
y't = 0.37 X 33 + 99.7 = 111.9, 

/, = 114.7, 2^4 = 120.0, y% = 124.9. 

Hence 

ft = 109.7 — 109.9 = — 0.2, 
r, = 111.9 — 112.0 = — 0.1, 

r, = 0.0, r4 = — 0.1, fj = — 0.1. 

Y 




db~ db~ dh~ dh~ db' 


and 


Art. 169] 


NON-LINEAK FORMULAS 


648 


Substituting in (169. 1) these values of the r's and partial derivatives, 
we get 


t>, = 27a + j8 — 0.2 1 


— 33a j3 — 0.1 
V| = 40a 4- -f 0.0 > 
i»4 = 66a 4- p — 0.1 

— 68a -j- p — 0.1 


Residual 

equations. 


We shall complete the problem by finding the best values of a and p by 
the method of least squares. Forming the normal equations according 

to the rule on page 535, we get 


11068a 4- 223/8 = 21.0 ) Normal 
223a 4- 6/8 = 0.6 ^ equations. 

Solving these for a and /8, we find 


Hence 


o = — 0.0012, /8 = 0.162. 
m = 0.37 — 0.0012 = 0.3688, 


h = 99.7 4- 0.16 = 99.86. 


The required formula is therefore 


y = 0.3688a; 4- 99.85. 


Example 2. 
the formula 


Find more accurate values for the constants a, 6, c, in 


p = o(10)*^/<«*‘>. 


given the approximate values 


0o = 4.63, 6o = 7.46, Co.= 234.7. 

Solution. For the partial derivatives (dp/9a)o, {dp/db)o, (9p/d6)o we 
have 

(?£) = (I) = -^M.10. 

(I).= 

Also 

p'i = o«(10)‘»'‘/<«»**‘>, p', = Oo(10)W(«.*».), etc.; 
and 

fi = P^i — Pi, r, = — p,, etc. 



660 


EMPIRICAL FORMULAS 


(Chap. XVIII 


In the following table' are given the ol>serve(l values of I and jt, the 
eorre8|>oiiding values of the partial derivatives, and ^he corresponding r’s. 


No. 


V 

(^). 


( 2 ). 

r 

Group 

1 

-5.31 

2.95 

0.672 

-0.162 

-b0.005 

+0.095 


2 

-3.64 

3.45 

0.763 

-0.125 

+0.004 

+0.007 


3 

0.00 

4.52 


0.000 

0.000 

+0.010 


4 

8.01 

7.93 



-0.019 

+0.049 

I 

5 

11.98 

9.88 


1.165 

-0.035 

+0.541 


6 

16.82 

13.52 

3.149 

2.197 

-0.065 

+0.746 


7 

23.85 

22.24 

4.867 

4. 683 

-0.135 

-0.194 


8 

35.95 

43.96 

9.763 

13.526 

-0.372 

+0.265 


9 

44.90 


15.717 

26.326 

-0.701 

-0.002 



52.12 


22.583 

42.805 

-1.112 

+0.903 


11 




64.487 

-1.638 

+0.303 

II 

12 


281 .55 


156.525 

-3.772 

+0.668 


13 



74.668 

195.821 

-4.653 

+7.665 


14 


387.56 

85.765 

232.150 

-5.457 

+0.955 


15 

86.21 

453.31 

100.319 

281.108 

-6.526 

+1.137 


16 

91.34 

552.20 

122.212 

357.124 

-8.160 

+1.422 


17 

93.66 

602.53 

133.354 

396.756 

-9.002 

+1.564 


IS 

99.39 

743.49 

164.564 

510.656 

-11.387 

+ 1.086 

III 

19 

100.87 

784.07 

173.547 

544.139 

-12.080 

+2.099 


20 

104.64 

895.83 

198.293 

637.798 

-14.002 

+2.435 



Denoting the corrections to a, b, r,by a, p, y, respectively, and sub- 
stituting in (169.1) the values of the r’s and partial derivatives giv<r 
in the table, we get 20 residual equations for determining a, fi, y. In tbi.^ 
problem we are going to use the method of averages ; so it is not necessary 
to write down the residual equations. We simply divide the coefficients 
into three groups, as indicated in the table, and add the coefficients in each 


group. 

We thus get the following three equations: 



14.512a 

+ 8.364tf — 

0.245y 

=s 

-1.254 

:i01.706a 

+ 731 .640^ — 

17.7057 

= 

-10.757 

892.289a 

+ 2727.581/J — 

61.1577 


-10.643 


Solving these equations for a, fi, y, we get 
a- -0.09669, g - 0.06001, 1-43991 















A*t. 170] BOTH VARIABLES SUBJECT TO ERROR 561 

SO that the corrected values of the constants are 

a = 4.63 - 0.09669 = 4.432f 

6= 7.45 + 0.06001 * 7.510 

c = 234.7 + 1.43991 = 236.14 

The final equation is therefore 

P-4.433(10) T.510f/(236.14+0 


170. Determination of the Constants when Both Variables are Sub- 
ject to Error. In Arts. 166-169 it was tacitly assumed that the given 
values of the independent variable were absolutely correct and free from 
all error; the values of the function alone were supposed to be subject 
to error. This assumption is legitimate in most cases, for it is usually 
possible and practicable to obtain the values of one variable more 
accurately than the other. 

If both variables are subject to errors of the same order of magnitude, 
the problem of finding the best values of the empirical constants is more 
complicated except in those cases in which the data can be plotted as a 
straight-line graph, either directly or after a suitable change of one or 
both variables. In the present article we shall treat only the simple case 
in which both variables are of equal weight. This is sufficient for most' 
problems; for, as was seen in Art. 168, it is not often necessary to take 
account of differences in weight. 

Let us consider n pairs of values (xi, yi), (X 2 , y 2 ), • * * and 

let these be plotted as points on a straight-line graph. The line which 
best fits these points will evidently be that for which the sum of the 
squares of the perpendicular distances from the points to it is a minimum. 
The equation of any straight line may be written in the* form 


( 1 ) 


ox -f 6y + 1 = 0, 


this symmetrical form being used because both x and y are equally subject 
to error. The perpendicular distance from any point ex', y') to the line 
(1) is given by the formula 


( 2 ) 


Vo» + 


The sum of the squares of the perpendicular distances from the points 
(»i. yj), etc. to the line (1) is therefore 

(3) P(a,6) ^j7[(‘». + 6y. + l)* 

+ + Jy, + 1)’ + ■ •• + (ax. + 6y. + 1)»] 


I 



:w2 EMPIUICAL FORAH LAS . (Chap. XVIII 

Since this is to be a minimum, its partial derivatiyes with respect to a 
and b must each be zero. 

Taking the partial derivative of (3) witii respect to a, we have 

+ • • • + (ox. + fty, + 1)*] 

+ a 2 ^ fr 2 + hi + 1) + * 2 (ox, + by, + 1) 

+ • • • + Xii(axK hyii -f- 1)]. 

Expanding the terms within the brackets, reducing to a common denomi- 
nator, and collecting terms, we get 

^ +•(>•-»’)& 

-i- oh*(2»* — 2y*) — 2o6 — ®«]* 

Likewise, by symmetry, 

f = (.■ + t.). [“(°’-f)S»ir + (»■->•)& 

-f 0*6 (2y* — 2**) — 2o6 2* — 6»]. 

Multiplying (4) by a, (5) by h, adding the results, and simplifying, 

ire get 

/tf\ I 1.3^ 2 r 1 I T 

<*> + + + 

But since dF/da = 0 and dF/db = 0 for a minimum, (6) reduces to 

o2» + 62y4*» = o, 

oral) .(f) +»(&) + , = „, 

which shows that equation (1) is satisfied by the values 



In other words, the best representative line always passes through the 
centroid of the given points. 

Since dF/da and dF/db must be zero for a minimum, we have from 
(4) and (6), respectively. 



Aiit. I7U| 


JiOTH VAHIABLES SEBJECT TO EH BOH 


r)ri3 


(170.2) — a*)2*y+ (6» — o*)2» — 2a6 2:y 

+ a6*(2** — 2y*) — on z= 0, 

(170. 8) o(o* — 6*)5>y + (o* — 6*)2y — 2a6 2-r 

— o»6(2»* — 2y*) — = 0. 

Problems of the type treated in this article are to be solved by means 
of formulas (1?0. 1) and (110.2) or (170.1) and (170.3), always using 
(170.1) first. We shall apply this method to Example 1 of Art. 167. 

Example. 


X 

y 

xy 

** 

y* 

0.5 

0 31 

0 155 

0.25 


1.0 

0.82 

0.820 

1 00 

0 6724 

1.5 

1.29 

1 935 

2.25 

1 6641 

2 0 

1 85 

3.700 

4 00 

3 4225 

2 5 

2 51 

6 275 

6 25 


3 0 

3 02 

9.060 

9 00 

9 1204 

Sums 10 5 

9 80 

21 945 

22 75 

21 2756 


To facilitate the computation, the several known quantities are arranged 
in tabular form as shown above. 

Since 

2* _ 10.6 _ - _ 0.80 4.90 

n 6 ’ n 6 3 

we have by ( 1 70. 1 ) 

4 90 

1.750 + ^b+.l = 0, 
or 

. 6.250 + 3 

*= — 

Substituting this value of 6 in (170.3) and reducing, we get 

6.71870* + 23.45480* + 6.1650 = 0. 

Solving for o, we find 

0 = 0, —3.8191, —0.28227. 

The corresponding values of b an found from the equation 

b = — (8.25a + 8)/4.9 to be 

b = — 0.61284, 3.4796, —0.30981. 













o64 


EMPIRICAL FORMULAS 


[Chap. XVIII 


Since the slope of the line (1) is — a/h, it is obvious that the values 
« = — 3.8191, b = 3.4796 are the only ones which will fit the data of 
this example. The required line is therefore 

— 3.8191* + 3.4796y + 1 = 0, 
or 

3.819x — 3.480y = 1, 
or 

y = — 0.2874 + 1.097*. 

This last equation agrees closely with that found by the ordinary method 
in Art. 145. 

If we compute the sum of the squares of the perpendicular distances 
from the several points to this line, we find 

2d* = 0.00618. 

For the line found in Ex. 1, Art. 167. we find 

2d* = 0.00619 ; 
the two results are thus practically identical. 

Remark. The reader will observe that the determination of the best 
representative line by the method of the present article involves but little, 
if any, more labor than the ordinary method of Art. 167. 

171. Finding the Best Type of Formula. There exists no general 
method for finding the best type of formula to fit any given set of data. 
Probably the best one can do is to proceed as follows: 

1. Plot the data on rectangular coordinate paper, taking care to choose 
the proper scales along the two axes so as to make the graph show up to 
the best advantage. 

2. If the graph is a straight line, or nearly so, assume a formula of 
the type 

y = o + 6*. 


3. If the graph is not a straight line but is a fairly smooth curve 
without sharp turns or bends, it is likely that the data can be fitted by 
some one of the following formulas: 




Remarks and Suggestions. 

(») 

y = a + 6* + c** + d**. 

Linear in the constants. 

(b) 

h 

y = a+ j 

Linear in constants. Put 1/x — t 


to plot. 



Art. 

171 ] FINDING BEST TYPE 

OF FORMl LA 55.5 



Remarks and Suggestions. 

(C) 

1 1 

y = - 1 i » = O + 

a +bx y 

Put 1/y = u and plot the straight 



line .u = a + 

(d) 

y* = a + fr* + c** + <***• 

Linear in constants. 

(e) 

y = ab‘. 

or log y = log a -|- X log b. 

(0 

y =r oe**. 

or log y = log a + bx log e. 

(g) 

log y = a -|- 6* + «**• 

Linear in constants. 

(h) 

_ X 

^ a -f- 6* + cx* * 


or 

— ■= a bx cx*. 
y 

Linear in constants. 

(i) 

y = a** 

or log y = log a + n log x. 

(j) 

y = oar" + 6- 

Use general method of Art. 169 

(k) 

y z= oe** + c. 

if if ff ti if if 

(1) 

» = «+»* + ”■ 

a a ti it a a 

(m) 

y = oe** + ce**. 

a a ti a a a 

(“) 

y = ox" + &**• 

a u it ti ti it 


4. As aids in determining which of the formulas (a)-(n) to use in any 
given problem, the following suggestions are offered : 

(a) If the observed data give a straight-line graph when plotted on 
logarithmic paper, use the formula 

y = ax". 

(b) If the data give a straight line when plotted on ^emilogarithmic 
paper, the proper formula is 

y = or y = ab^. 

(c) If the points (1/x, y) or (x, 1/y) lie on a straight line when plotted 
on ordinary coordinate paper, the proper formula is y = a + b/x in the 
first case and y = l/(a bx) or 1 /y = a bx in the second case. 

5. The polynomial formula 

y = a + bx + cx^ dx* + * • • + gx" 

can be used to fit any set of data by taking a sufficient number of terms. 
The requisite number of terms is given by the following 


36 



EMPIRICAL FORMULAS 


[Chap. XVIII 




Theorem: If the values of x are in arithmetic progression (equidistant) 
and the nth differences of the j’s are constant, the last term in the required 
polynomial is z". 

This theorem is simply a corollary of the theorem proved in Art. 19. 

For example, the third differences in the following data are nearly 
constant; so the required polynomial is 

y = a -f- 6® + c** + dx\ 


X 

y 

Aiv 

Aiy 


0 

0 




0.1 

0.212 

BEh 



0.2 

0.463 


0.039 


0.3 

0 772 

0.309 

0.058 

0.019 

0.4 

1.153 

0 381 

0.072 

0.014 

0.5 

1.625 

0.472 

0.091 

0.019 

0.6 

2.207 

0.582 

0.110 

0.019 

0.7 

2.917 

0.710 

0.128 

0.018 

0.8 

3.776 

0.859 

0.149 

0.021 

0.9 

4.798 

1.022 

0 163 

0.014 

1.0 

6 001 

1.203 

0 181 

0 018 


This theorem applies only when the z’s are taken at equal intervals 
apart. It rarely pays to take more than three or four terms in a poly* 
nomial formula, on account of the labor involved in determining the 
constants. 

172. Smoothing of Observational and Experimental Data. Some- 
times it may be inconvenient or practically impossible to obtain an 
empirical formula to represent a set of observation.s or measurements. In 
such cases the observations or measurements should be plotted on squared 
coordinate paper as usual. Then if it is known that the function under 
consideration is continuous, or if the plotted points seem to follow some 
law, a smooth curve should be drawn which will be a good compromise 
for all the points but not necessarily passing through any of them. 
Ordinates to this curve can then be measured at any point on the hori- 
zontal axis. 

If the observations or measurements have been made for equidistant 
values of the independent variable, a better graph can be obtained by 
first correcting or smoothing the observations before plotting them. Prob- 
ably the simplest and easiest method of smoothing is that due to Carl 
Runge and will now be explained. 









Art. 172 1 


SMOOTHING OF DATA 


557 


Case I. Straight-line graphs and graphs with small curvature. When 
the plotted points seem to lie approximately on a straight line or on a 
curve of such small curvature that any three consecutive points lie approxi- 
mately on a straight line, we correct the ordinate of the middle point of 
the three by replacing the graph of the function over the interval X|-i to 
Xui by a straight line. 

Let y denote the ordinate to the approximating line, let h — Xi — *«_i 
= Xui — Xi, and let v be a new variable with origin at X| such that 


( 1 ) 



Then u = — 1, 0, 1 when x = x*-,, Xt, xui. 

The equation of the approximating line may be written 


(2) y = ao-|-Ottt. 

The residuals for the line are then y — y or Oo -f* — y> ill order 

that the line fit the data as closely as possible in the interval x<-i to x^t 
the sum of the squares of the residuals in this interval must be a minimum. 


Hence 

1 

2 (oo-f o,« — y)* 

-I 

is to be a minimum. Then by Art. 167 we have 

^2 (<*o + a,« — y)* = 2 2 (flo + Oitt— y) =0 
Otto -I 

0 ‘ * 

^ 2 (oo4*aitt— y)* = 22 (“o + y)u = o. 

v«i -1 -i 

From the first of the above equations we get 


from which 


or 

( 8 ) 


Oo — 0| — yi- 1 + Oo — yi + fflo "I" <*» — yt*i = 0, 


flo — 


yt-i ~H y« y^i 

3 


... „ yi-i + yi + yui 
Vi- 3 


The corrected ordinate at x< is thus the mean of the ordinate at Xi and 
the two adjacent ordinates. Since we are interested only in yt, we make 
no use of the second of the above minimum equations. 



•»58 EMPIRICAL FORMCLAS (Chap. XVIII 

On adding and subtracting j/t in the right-hand member of (3), we get 


if I yi-i— + 

or 

(4) y. = y*+^^, by Art. 16. 

This equation is more convenient for use than (3). 

CoLse II. Graphs with large curvature. When the curvature of the 
graph is so large that the graph cannot be approximated by a straight 
line in a given interval, we replace the graph by a vertical parabola passing 
through five consecutive points whose abscissas are Xt xt-i, Xt, Xui, Xut. 
Using a new variable u with origin at Xt as in Case I, we may write the 
equation of the parabola in the form 

y = fflo -f flitt -|- OjU*. 

Then u = — 2, — 1,0, 1,2 when X — Xi- 29 X{^iy Xi^i^ Xi^2' 

The residuals are Oo 4* ~l” <* 2 *** — y> m order that the parabola 

fit the data as closely as possible, 

2 (Oo -l-o,« — y)- 

-X 

must be a minimum. Equating to zero the partial derivatives of this with 
respect to Oo, fli, « 2 , we get 

2 

2 (Uo-f a.u-f-ajU* — y) =0 

-2 

2 

2 (flo + flit* -j- fl 2 tt* — y)« = 0 

-f 

2 

2 (oo 4* 4- «2«* — y ) “* = o» 


or 


•Soo 4- 2 4" <*2 2 M* = 


-2 -2 


2y 


flo 2 ^ 2 “h ^2 2 — 2 

-2 -2 -2 -2 

flo 2 + fli 2 w* + «2 2 = 2 

-2 -2 -2 -2 



Am. 1721 SMOOTHING OF DATA 559 

2 t 2 1 

Since = 10, 2 — 34, and ^u = 0, 2 — 0» the above equations 
-2 -2 -2 •! 

reduce to 

6oo lOOj = j -|- + y« + tfui + Vut 

lOot = — 2yi_, — yt 1 4* yi*t + 2yt*i 

lOao + 34aj = 4yt-2 + yi-i -f Vui + 4yi„. 

Since we are interested only in Ao, we eliminate a 2 between the first and 
third of these equations and thereby obtain 

— 3yt-2 -|- 12yi-i -{~ 11'y< 4- — 3y(t2 


Now adding and subtracting y, in the right-hand member, we get 


Oo = y< — 


35 


= yi— 


1- 

35 


(yi 2 — 4y4_x -f 6tft — ^yux 4" yut) 

2 , by Art. 16. 


Hence we have 

3 

(5) yi = y*— gT^Vi 

as the corrected ordinate at the point zi. 

Formulas (4) and (5) are the smoothing formulas for the two cases 
considered. They may be applied as many times as necessary, or until 
the corrections become negligible in comparison with the y’s. 

It is to be noted that (4) will not correct the first and last observations 
of a set and that (5) will not correct the first two and last two observations. 

Extensive tables of smoothing formulas of the type herein considered 
can be found in Whittaker and Robinson’s Calculus of Observations, pp. 
295-296 


Example 1. When the following data are plotted on squared paper, 
the plotted points are seen to lie approximately on a curve of small 
curvature. Hence the y’s can be smoothed by formula (4). Two applica- 
tions cf the formula are made for purposes of illustration, the results of 
the first and second corrections being denoted by /r and respectively. 

It will be seen from the above table that the corrections are much 
smaller in the second application of the smoothing formula. This is 
generally the case. 



560 


EiMlMRICAL FORMULAS 


IChap. Win 



Example 2. When the following observations are plotted on squared 
paper, they are seen to lie approximately on a curve of considerable curva- 
ture. Hence we smooth them by formula (5). We make two applications 
of the formula, as before. 

The corrections in this case are seen to be much smaller in the second 
application of the smoothing formula. 










































































































































































































562 


EMPIRICAL FORMULAS 

EXERCISES XIX 


[Chap XVIII 


1. Find by the method of averages a formula of the form y = ax” 
which will fit the following data: 


X 

280 

295 

312 

330 

355 

370 


y 

32.3 

37.6 

44.9 

53.3 

67.2 

76.1 



2. Plot on logarithmic paper the data of the above example and find 
a and n graphically or from selected points. 

3. Find by the method of least squares a formula of the form 
y = a -|- bx* which will fit the following data : 


X 1 

1 20 

24 

29 

36 1 

43 

y 

1 2100 

2980 

4310 

6600 

9360 


4 . The data in the following table can be fitted by a formula of the 
type y = ax”. Find the formula by the method of averages. 


X 

55 . 

25 

14 

7 

4 1 

3 1 

2 ■ L 

y 

6.7 1 

15.6 

28.9 1 

60.4 

no 

149 

O 

CO 


5. The data given below can be fitted by an exponential formula of the 
type y = ae'*'. Plot the data on semilogarithmic paper and find values 
for a and b. 


X 

2.5 

5.0 1 

7.6 

10.0 

12.5 

15.0 1 

y 

' 76 

52 

36 

24 

16 

1 11 


6. Solve the preceding example by the method of averages. 


7. Find by the method of least squares a formula of the type 
y = a + 6x* which will fit the following data : 


X 

8.1 

12.0 

16.2 

22.5 

33.0 

41.4 


y 

0.20 

0.44 

0.79 

1.53 

3.30 

5.20 



3. The data in the table below can be fitted by a formula of the type 
x/y = a -|- 6x. Find the formula by the method of averages. 


9. Work the preceding example by plotting the points (x, x/y) on 
ordinary coordinate paper and finding the values of a and b. 

Hint : Put x/y = u. Then the equation becomes u = a bx, the graph 
of which is a straight line. 

10. In Exercise 3 put x* = t and plot the equation y = a -f bf. Find 
from the grapli the approximate values of a and b and then find corrections 
to these values by the general method of Art. 169. 





Abt. 172] 


EXERCISES 


SOS 


11. Find by the method of averages a polynomial formula which will 
fit the data in the following table: 


X 

1.5 

3.0 

4.5 

6.0 

7.5 1 

9.0 

10.5 

12.0 

13.5 1 

1 15.0 1 

y 

14 

18 

26 

42 

69 1 

112 

174 

259 

370 1 

512 1 


12. The data in the table below are to be fitted by a formula having 
y = 20 as an asymptote. Find the formula by any method. 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

y 

84 9 

79 9 

75 0 

70 7 

67 2 

64 3 

61 9 

59 9 

57 6 

55 6 

53 4 


13. The table below gives the atmospheric refraction for a star, at 
various altitudes above the horizon, .\ssume that R" = (t/(b -^t&nh), 
omit the first and last values in the table, and find a and b by the method 
of least squares. 


h 

6* 

D 

D 

6 * 



o 

O 

40“ 

e 

S 

' 

90* 

R 

34 ' 50 ' 

18 ' 06 ' 

11 ' 37 ' 

8 ' 23 ' 

6 ' 29 ' 

5'16^ 




D 


14. The following points lie approximately on a straight line. Smooth 
the y’s by two applications of formula (4) and draw the line 


1 

2 

3 

4 

6 

8 

7 

8 

9 

10 

11 

13 

13 

M 

16 

18 

1 17 

18 

19 

30 

1 8 

2 0 

|3 6 

3.8 

4 3 

6.1 

5.6 

6.6 

8.8 

7.8 

8 1 

8.8 

0.6 

9.8 

lo.e 

1 11 1 

1 

1 117 

1 

12 5 

12.9 

13.7 


15. Smooth the y’s in the following set by three applications of formula 
(5): 


s 

1 

3 

8 

4 

6 

8 

7 

8 

9 

10 

11 

13 

13 

14 

16 

18 

17 

9 

18.6 

11.0 

D 

m 

D 

m 

D 

D 

9.1 

9.4 

9.6 

9 8 

9.8 

9.1 

8.6 

7.8 

87 























CHAPTER XIX 


HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS 

173. Introduction. Any periodic function can in*, rcpre.sented by a 
trignonometric scries of the form 

(1) y = Oo + Oi cos z -|- Ot cos 2x -{- * * o, cos nz 

hi sin z -|- bz sin 2z -|- ■ * * sin nz. 

This function is periodic and has the period 2ir. A periodic function 
having a period different from 2ir can be reduced to the form (1) by a 
suitable change of the independent variable (.\rt. 176). 

When we wish to find an empirical formula to represent a phenomenon 
that IS known to be periodic — such, for example, as the tides, alternating 
currents and voltages, mean monthly temperatures, etc. — , we should always 
assume a formula of the type (1). If the values of the function are 

known for certain equidistant values of the independent variable — from 

readings of an instrument, measurements of a graph, or otherwise — , it is 
an easy matter to find the unknown constants Oo, Oi, • a,, b,, ft*. ft« • 

In the present chapter we shall give explicit formulas for computing these 
coeffici**nts when the number of equally-spaced ordinates is either 1? or 24 
We shall aKu gi\e .schemes for reducing the numerical work to a minimum. 

174. Case of 12 Ordinates. We assume that the period of the unknown 
function IS 2*r and that the value of the function is known for 12 equi- 
distant values of the independent variable. The appropriate formula is 
then 

f 174. 1 ) flo -f- Oi cos z -f- Oz cos 2z -f- Os cos 3z + <*4 cos 4z 

-f a, cos 5z -f a« cos 6z ft, sin z -j- 6* sin 2z 

-f fts sin 3z -f- 64 sin 4z -f- 6s sin 5z. 

Let the corresponding values of z and y be as given in the table below. 


X 

0* 

30* 

60* ! 90* 

120* 

150* 

180* 

210* 

240* 

270* 

300* 

330* 

y 

1 

vi yt 1 Vf 

Vi 

Vi 

Vt 

i 

y% 

Vi 

yi» 

yii 


Then on substituting m (174.1) each of these corresponding sets of 
values we obtain the following conditional equations: 


S64 




Art 171) 


CASE OF 12 ORDINATES 


yo — flo "f“ *4“ “f“ ^^4 “f" “4" “1“ ® ® ’ ^2 “f" ® ”1" ® ' ^4 "f" ®*^s# 

yi = Oo + + "g®* + 0 • Cs ^at — — ®e + 2 ^‘ 

+ ^* + + 4^/ > 

.1 1 1.1.. V^r . V3i 

y 2 — Oo + -gOl 2^2 Os gO* + "g®# + ®« T • 

I A I. vli V3i 

+ 0-6, —b,— —b,, 

— do “I” 0 • di — 0,2 “1“ 0 * da “4“ ^4 "1“ ® ”4“ “I” ® 

- - 63 + 0 ■ ^4 + 65 j 

_ 1 1 I 1 1 I ^ j V3- 

^4 — flo — -g fli — -2^2 + fla 2^4 — "gfla + "T g 

JL(\ h ^ 

- -^6. + 0 • 6. + > 

ye = flo — " 2 ®* "i" ® ' ®* — ~2^* — ®* “t” " 2 ^* 

— ^62 + 63 — -^- 6 «+ l^e, 

3 3 3 

ye ~ flo — 0>i "f“ ^2 — ^^3 “4" ®4 — da "4“ de 0 ‘ ^1 “4“ ® ’ ^2 
”4" 0 * ^3 "4“ 0 * ^4 ”1” 0 ’ ^8 y 

yt = ®o — " 2*2 + 0 ■ ®s — " 2 ^* *** 2 

^ 6 — 6, + ^ 6» , 

^ 2 * 2 ‘ 2 *’ 

1 I . 1 1 « 

y, = Oo - - gtti — — Oi + a, — - a« — gO» +^o* --Oi 

+ Vlj, + V\+y3j., 

2 33 

yg = tto -)- ^ * ®i — O2 "}■ 0 Oj -|- 0(4 -|- 0 • O5 — (Ig — 61 -f- 0 bt 
-j- 63 -|- 0 • 64 — 65 , 


.1 1 1,1, V3. V3. 

yio — Oo + -gOi — -g a 2 — Oj — 2 ' ’2®* "> ®* 

+„. 4 . + :^». + ^»., 



666 HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS [Chap. XIX 

I V3 , 1 , n 1 V3 VT, 

yii = Oo + -^ai + -Oj + 0 • o, — -a« j-a, — c, — -bi ^b. 



To solve these equations for the a’s and 6's we apply the rule of Art. 
Di? for writing down normal equations. Thus, to find Oo we multiply 
each equation by the coefficient of Uo in that equation and add the results. 
We then get 

i2ao = yo + yi + ^2 + »* + + y« + yr + ys + y* + yio + yn , 

which gives Ug explicitly in terms of the known quantities yo, yi, - ‘ ' yu • 

To find a, we multiply each equation by the coefficient of a, in that 
equation and add the results. This gives * 

- , Vl , 1 1 V 3 VT 1 

6o, =yo + — yi + -yj — -y« — - -ys — y,— ^ yi— ^y. 

,1 , V3 

+ 2^** "I — ■ 

Continuing in this manner, we get the following equations for ‘finding 
the remaining a’s and 6’s: 

* The reason for the disappearance of all the a*s and h's except one in the normal 
equations is as follows 

Since the multipliers used in obtaining the normal equations are sines and ro<iin»‘s 
the coefficients of the a’s and b’s in the resulting normal equations are all oi s^rne 
one of the forms 

^cosqXr, ^Binpxr^inqxr, ^ sin cos gap,, ^cospXrCo% qx,, 

r r r r r 

X sio* pm„ ^ cos* q»„ 

r r 

where r takes the values 0, 1,2, (m — I), and m is the number of equidistant 
ordinates. But 

sin pxr = 0, ^ cos qxr = 0, 2^ sin px, cos qx^ = 0, 

f r r 

sin sin qm, =s 

r 

^ cos cot = 

If 

T sin* ps, — ~ ^ • 

. 2 » 2 

SincA only one of the a’s or b’s in each normal equation has a coefficient of the form 
2^ sin* pxp or 21 cos* qx,, it is ewident that all but one must disappear. 

r r 

For a simple and elegant proof of the relations given above, the reader is referred 
to Kunge and Kopig's Numertschea Rechnen, page 212 







Art 174] 


C'ASE OF 12 ORDINATES 


567 


. 1 1 1 1 1 1 I I 1 

6aj = yo + -y, — -yj — y, — -y, -f- y, + ^y,__y,_y, 

1 . 1 

2^‘® ' ’ 

6o, = yo — y2 + y4 — y« + y» — yio , 

11. 11. 11 

6a« = yo — -yx — --y» + ya— 2^4— gy. + y*— gVi— 2^8 

i 1 1 

+ y» — gy.o — , 

fi V3 , 1 1 , V3 . V3 1 

6o» = yo g-yx + -y, — -y4 + — y, — y, + ^y, — -y. 

, 1 V3 

“i 2 2^10 2 yii f 

I2aa = yo — yi + ya — yi + y4 — ya + y#— yr + ya— ya + y.o — y.. , 

— 2^1 H — + ya H — ^yo + -^y» — ^y? ^ ya 

V3 1 

— ya — -^yia— gyii, 

V3 

66a = - 2 -(yi + y* — y« — y» + y» + y* — y»o — y» 0 , 

66, = y, — y, 4* ys - - yr + y. — y.i , 

3 

66* = -^ (yi — ya 4- ya — ya + yi — ya + y.o — y«) , 

1 V3 . V3 . 1 1 , V3 

66, = -yi — —y: 4- ya — g-ya 4- gy* — -gy^ + “ ^y» — y» 

. V'3 1 

4" y*® “ ■ ■2y“ • 

We could find the values of the a’s and 6*8 directly from these equations, 
but it would be a tedious process on account of the large number of terms 
in the right hand members. We therefore reduce the number of terms 
on the right b) grouping term.s and substituting new variables for the 
different groups. The first groupng gives 

120, = (yo 4- ya) 4- (yi + yn) 4- (ya + y.o) 4- (ya 4- y#) 4- (ya 4- ya) 
t lys 4 ya), 



568 


HARMONIC ANALYSIS OF EMPIRKWL FI NOTIONS IChap XIX 


6ai — (jfo ye) + H" y«) + "g (y* + yw) — "I (y4 + y») 

— + yOj 

60* = (y, + y,) 4 - i(yi 4. y^,) _ -^(y^ -I- yj,) _ (y, 4. y,) 

— -|(y« + ys) + -|(y» + yr), 

ба, = (yo — ye) — (y, + yio) + (y4 + y.), 

= (yo + y«) "2^^^ 

— ■|(y 4 + y*) — |(ys + yT), 

60, = (yo — ye) — ^(yi 1- y»>) + |(y2 + yio) — yo) 

+ ^(y* + yr), 

I2ae = (yo + ye) — (yi + yu) + (yo 4 - yio) — (ys + y») + {yt + ye) 

— (y* + y-i), 

6^1 = ■|(yi— y»i) + ^(yo— yio) + (ys—y*) + ^^^(y*—y») 

+ -|(y»— yr), 

^ 3 

бб, = -^[(yi — yii) + (yo — yio) — (y4 — y.) — (y* — yo)], 

66, = (yi — yn) — (y, — y,) 4 (y, -y,), 

66«= -^[(yi— yii) — (yo— yio) + (y4 — yo) — (y* — yi)], 

66, = -| (y, — yu) — ^lo) + (y* — yo) — ^ (yo — yo) 

+ |(yo— yr). 



Art. 174] 


CASE OF 12 ORDINATES 


5«9 


Let US now put 


yo + »« =«« 

Vi + yii = 

Vi + yio = tt2 
ys + y» =tt3 

y« + y» =«* 

y* + yT =tt. 


yo— ye =vo 
yi— yn = «i 
y»— yio = 
ya — ye =»a 
ye — ye =Vt 
ya — yr =t;a 


Then the normal equations become 

120o = Uo Ui "I" Wj ^a 4" ^e 4* ^8 = (*^e 4" ^a) 4* (**i "1" ^») 4* (^a 4” ®«)» 


- . V3 , 1 1 

6ai = 1>e H 4" — -gMe 


V3 , V3, » 

- g - t *8 = Ve H ^(**1 — 


4- «(tt2 — tie), 


602 = tie 4- Jtii — |lia — tia — -|tl* 4- ^Ut = («e — tie) 

4" ^(tii 4’ tie) — "I (tia 4" tie), 

60, = Ve — tt* 4- 1«« = Vo — («2 — tie), 

60e = lie — |«i — -|«2 4- tia — -Itie — -Ilia = (tie 4- ^>) — ■|(tti 4" «$) 
— |(tta 4- tie), 

* V3 . 1 1 , V3 V^/ X 

ба, = Vo g-Uj + — ~u, 4- — Uo = Vo “ “•) 

4* |(ti2— tie), 

120* = Uo — til + tia — tt, 4- tt« — ti, = (Uo — ti,) — (Ui 4- «o) 4- (tt, 4- Ue), 

6bi =‘|t»i 4* 4" t>e 4" “I" 

4- ^(t>a4-t»e) 4-t»a, 

бб, = •^(Vi4- V, — Ve — V,) = ^[(Vt — V,) 4- (V, — Ve)], 

66, = V, — V, 4- V, = (Vi 4- V,) — V, , 



670 HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS (Chap. XIX 

664= + V4 — Vs) = "[(V, — Ve) — (»j — 

6fr, = — -^v, + V, — ^»4 + -r, = I (r, + v,) 

— • 

If we make the further suhstitutions 

u« -|- u* = fo Uo — Ms = So Vi 4- Vs = pi Vi — v# = q, 

U| 4* Ms = r, u, — ttj = s, t>* + V4 = ft Vs — v« = 92 » 

ttj 4- U« = fs Ms- M, — Sj 

the normal equations take the simpler forms 

l2oo = To 4 " Vi 4 " *"2 = ^0 4 " (*’1 4 * *"2)> 

- . , 1 
6ai — Vo 4* 2 2** ’ 

602 = 4 « 4 - 2'’* ~ 2^-’ = *0 4 - 2 

ба, = Vo --S i, 

604 = ro — -r, — = p* — -(r, + r,), 

6as = V, ^-5, 4- 2*» * 

12o, = a, — »■! 4- »'2 = 4o — (r, — fs), 

4.V 1 , V3 , , 1 , V3 

“ '2^* ~2 I'* — 2^1^* ’ 

V3 

662 = 2 + 9*)» 

бб, = p, — t>, , 

6^4= —(91 — 92), 

= |pi — ^P2 4 - V, = V, 4- |pi — — ft . 



Art. 174! 


CASE OF 12 ORDINATES 


671 


Finally, we write 

r, + r, = I 
fi — r, = m 


ql + q^ = 9 
qi — ?t = h. 


Then the equations for finding the coefficients in the trigonometric series 
are 


(174.2) 


«« - 12 (*-0 + 0 , 


<*. = !(«• + I”*) > 


«.= g(Vo— «l). 


a.= -J(v« — 

«•= i^g^So — m), 

t, = I ^v, + 4- ^P.) , 

h — 

h* - 12 S, 

ft» = •|(Pt — ».), 

h - 


The several substitutions made above can be accomplished very simply 
by the addition and subtraction scheme given below,* starting with the 
given y’s. 


37 



672 


HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS [Chap. XIX 


Vo vi vt y» y* yt 

y« yu yio y» y» yi 

Sum tto Ui Uz Ut Ut tt| 

Diff. Vg 1>i Vj Vt Vi Va 


Uo Ui u, 

Va Us U4 

Sum Tg fi Tz 

Dlff. Sq Si S 2 


Vi Va 
Vs V« 

Pi P» 

Qi 9t 


ft qi 

rz qt 

Sum«2 g 

Diff. m k 


The quantities Vq, Va, and rg arc printed in heavy type because they are 
somewhat isolated from the other quantities which appear in the final 
formulas for the coefficients. 


Check formulas. Since the chances of making an error in the additions 
and subtractions are considerable, it is important to have a reliable check 
on the computed a’s and &’s. As a check on the a’s we have from the first 
conditional equation 

yo = Oo + ttl "I" "H ®S + ®i “h ®» + • 

To find a check for the b’s we subtract the twelfth conditional equation 
from the second, giving 

Pi. — yii = “I" V 35* 25* V 364 "1“ 5* j 

or, since 

Vi = yi — yn , 

Vi = 61 &s "f* 26* -[■ V3(6* -|- 64). 

The check formulas are therefore 


( 174 . 3 ) 


5 2a = yo , _ 

( (61 65) -j- 26* -4" V3(62 -|“ 64) = Vi . 


We shall now work an example to show the application of the above 
scheme. 


Example 1. Find an empirical formula to fit the following data: 



Aht. IVl case of 12 ORDINATES 673 


B 

B 

B 

B 

90* 

120* 

160" 

180* 

210* 

240” 

270” 

1^9 

330* 

B 

9.3 

B 

17.4 

23.0 

m 

31.0 

15.3 

B 

m 

-13.2 

-14.2 

-6 0 


Solution. The first part of the computation is carried out according to 
the scheme given above and should be self-explanatory. 

0 1 S S 4 6 


Si’s 

9.3 

15.3 

15.0 

— 6.0 

17.4 

— 14.2 

23.0 

— 13.2 

37.0 

— 8.0 

31.0 

4.0 


Sum (u) 

24.6 

9.0 

3.2 

9.8 

29.0 

35.0 


Diff. (v) 

— 6.0 

21.0 

31.6 

36.2 

45.0 

27.0 



0 

1 

s 


1 


t 

u’s 

24.6 

9.0 

3.2 

v’s 

21.0 


31.6 


9.P 

O.'i.O 

29.0 


27.0 


45.0 

Sum (r) 

~M4 

44.0 

32.2 

Sum (p) 

48.0 


76.6 

Diff. (s) 

MS — 

26.0 

25.8 

Diff. (q) 

— 6.0 

— 

13.4 


r’s 

44.0 

32.2 

g’a 

— 6.0 
— 13.4 

1 

= 76.2 

ff = 

— 19.4 

m 

= 11.8 

h = 

7.4 


Now substituting these quantities in equations (174.2), we get 
Oo = -g (34.4 -f 76.2) = 9.22, 

o, = I 6.0 — 26^ — 12.9^ = — 6.90, 

o, = 4^(14.8-f 5.9) =3.45, 

6 

a, = ^ (— 6.0 -J- 25.8) = 3.30, 

6 

o« = -1- (34.4 — 38.1) = — 0.62, 

D 

0, = I 6.0 26^ — 12.9^ = 0.60, 

1 



















574 


HAHMONIC ANALYSIS OF EMPIRICAL FUNCTIONS [Chap. XIX 


= 4 (36.2 + 24.0 + 66.8) = 21.09, 
o 

8, = -^(-19.4) =-2.80, 

8, = 4(48.0 — 36.2) = 1.97, 

U 

6* = ^(7.4) =1.07, 

= ~ (36.2 + 24.0 — 66.8) = — 1.02. 

6 

Applying the check formulas (174.3), we have 

Yfl = 9.30 = yo , 

(bi + 65) + 26, + V3(62 + 6«) = 21.01 = u, . 

The coeilicients are therefore correct and the final formula is 

y = 9.22 — 6.90 cos x + 3.45 cos 2x + 3.30 cos 3x — 0.62 cos 4a: 

+ 0.60 cos 5x 0.25 cos 6x + 21.09 sin x — 2.80 sin 2x 
4- 1.97 sin 3x -|- 1.07 sin 4x — 1.02 sin 5a;. 

Note. Since the terms of a trigonometric series are additive, it is 
necessary that the coefficients all be computed to the same number of 
decimal places (Art. 7). 

170. Case of SS4 Ordinates. For 24 equally-spaced ordinates the values 
of X arc taken at equal intervals of 15® apart from 0® to 345® inclusive. 
The appropriate formula for this case is 

(170.1) » = ao-f Ui cos X -f o, cos 2x -f o, cos 3x + <*4 cos 4x -f a, cos 5x 
-j- a, cos 6x -f- a, cos 7x -f- a« cos 8x + a, cos 9x -f- Uio cos lOx 
Uii cos llx Ui, cos 12x 6| sin x 4* 0, sin 2x -f- 6, sin 3x 

-|- 64 sin 4x 6, sin 5x 4~ sin 6x 6, sin 7x 4- 6, sin 8x 
-|- 6, sin 9x -J- 610 sin lOx + bu sin llx. 


X 

Q 

15* 

30* 

45’ 

m 

75’ 

90* 

105’ 

120’ 

135’ 

150* 

165’ 

i 

0 

V 

Vo 

Vi 

y* 

Vo 

V4 

Vf 

Vo 

Vt 

Vs 

Vt 

Vio 

Vii 

Vit 


X 

195* 

210* 

226* 

240* 

255* 

270* 

285* 

300* 

315* 

CO 

345* 

y. 

Vio 

Vl4 

Vio 

Vis 

Fit 

Vio 

Vit 

Vto 

Vsi 

Vtt 

V 21 















































Art. 175] 


CASE OF 24 ORDINATES 


676 


Let the corresponding values of x and y be as given in the table above. 
Then on substituting in (175.1) these corresponding values of x and y 
we get 24 conditional equations. Applying to these the rule for obtaining 
normal equations, we get 24 equations in which the a’s and Vs are given 
explicitly in terms of the y’s. Then we group the terms in the right-hand 
members, substitute new variables for the different groups, group again, 
etc., just as in the case of 12 ordinates. The final formulas for computing 
the a*s and b*s are found to be as follows: 


( 175 . 2 ) 




It (’’• + + ^•> + vs** + ■!*• + ®**) > 

— 12 V* ** — — * / * 


ae — 12 * 


“' = B (”• “ + 1’* “ ’ 

“•=b('’-|')’ 

“• = B ~ • 

1/ V3 ,1 \ 

*•■ = B ~ “ W’ + i*‘ “ ®‘‘) ’ 


flia — 


^ V2^* *'•) • 



HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS [Chap. XIX 




h 

&«- 24 c, 


^ (^'P‘ + |P» — y-gP* — ^P« + SP. + V.) , 


6*= J2(^i — ?>)» 


‘' = B (°'’“ “ 8*’’ “ W‘ '*■ ‘’ 2 *’’* ■'■ *'’■ ~ "') ■ 


h — 


+ P.-P.)) , 

= h (l^‘ “ * 

^ ~ Ip* ^P* “ - *•) » 

where C = cos 15“ = 0.9659258, 5 = sin 15“ = 0.2.588190, and the other 
quantities are obtained from the given y's according to the following 
scheme : 


Vo 

yi 

y2 

ys 

y« 

y» 

y« 

yr 

y» 

ya 

y.o 

yii 

yi2 

y 23 

yi2 

y*i 

y^o 

y .9 

yi» 

y.T 

yia 

yi» 

yi* 

yis 

Sum t/.i 

M, 

W/ 

W3 

W4 

U, 

«• 

Ut 

Ub 

Ub 

U,o 

«n 

Diff. 

V| 

t'2 

Vs 

V4 

n. 

»6 

Wt 

Vb 

Vb 

Vio 

»ii 

ii.. 

U, 


Ws 

tia 

Ui 



Vi 

Vs 

V3 

W4 v. 


Wll 

ti,o 

W9 

Uh 

Wt 



Vii 

Vio 

Vg 

V» «T 

Sum To 

ri 

^2 

Ts 

r* 

r. 


Sum Pi 

p^ 

Pi 

P« P» 

Dlff ^0 

«i 


'^3 

«4 

«• 


Diff. 9, 

7/ 

93 

94 9» 

To 

ri 

^2 



7i 

9* 






rj 


r 4 



9i 

il 



b. 


ft. 

Sum lo 


T 


Sum ffi 

y» 


Sum 

e 

Sum e 

Diff. nto nil 

m. 


Diff. ht 

A, 


Diff. 

f 

Diff. d 



Art. 175] 


CASE OF 24 ORDINATES 


677 


Here the quantities Vo, Va> And qt are printed in heavy type because 
they are somewhat isolated from the other quantities which appear in the 
final formulas for the coefficients. 

A check formula for the a’s is given by the first conditional equation, 
and is 

2<» = yo . 


To find a check formula for the b’s we subtract the 23d conditional 
equation from the second and obtain 

yi — yis = Vj = 2S{bi + 6n) -I" (6a + 6io) V2(6a -|- 6a) 

-f- V3(64 -|- 6(,) -|- 2C(6a -f- 6 t) 26$ . 

The check formulas are therefore 


(176. 3) 


So = yo, _ 

• 2S(6» + 6„) + (6, + 6.o) + V2(6, + 6.) + V3(64 + 6.) 
-}- 2(7 (64 -f- 6 t) -j- 26a = . 


Example £. Find an empirical formula to fit the data in the following 
table: 


X 

D 

15* 

30* 

45* j 

60* 

75* 

90* 

105* 

120* 

135* 

150* 

165* 

180® 

y 

149 

137 

1 

128 

126 

128 

135 

159 

178 * 

189 

191 

189 

187 

178 


X 

195® 

210* 

225* 

0 

0 

255® 

270® 

285* 

300® 

315® 

330® 

345® 

y 

170 

177 

183 

181 

179 

179 

185 

182 

176 

166 

160 


Solution. The preliminary quantities are found by the scheme below: 

0 1 SS 4 » STSSlOll 


y’s 

149 

137 128 

126 

128 135 

159 178 189 

191 189 187 


178 

160 166 

176 

182 185 

179 179 181 

183 177 170 

Sum (tt) - 

327 

297 294 

302 

310 320 

338 357 370 374 366 357 

Diff. (t>) - 

-29 

—23 —38 

—50 

—54 —50 

—20 

— 1 8 

8 12 17 



0 

1 

t 

s 

4 

6 


u's 

327 

297 

294 

302 

310 

320 



338 

357 

366 

374 

370 

357 

Sum (r) 

665 

654 

660 

676 

680 

677 

Diff. 

(*) 

— 11 - 

- 60 

— 72 - 

- 72 

— 60 ^37 




































678 


HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS ICmap. XIX 





v"s 

— 23 — 

.3H —50 —51 

— 50 



17 

12 8 8 

— 1 

Sum (p) 

— 6 — 

26 —42 —46 

— 51 

Diff. (q) 

— 40 — 

50 — 58 — 62 

— 49 


0 

1 i 


I t 

r’s 

665 

654 660 

1 5*8 

— 40 —50 


676 

677 680 

— 49 — 62 

Sum (0 1341 

1331 1340 

Sum (y) 

--89 -112 

Diff. (m) - 

-11 

— 23 — 20 

Diff. (h) 

9 12 



I’s 1331 

k*s 9 




1310 

12 




« = 2671 

c = 21 




/= -9 

d = — 3. 


Now substituting these quantities 

Ill ( 175. 2), we find 

a, = 167.167, 

fll = 

= - 19 9H.3, 

02= - 3.410, 

o, = 5.471, 

a, = — 1.292, 


= 0.2.50, 

a, = 0 7.50 

Ot = 0.309, 

a, = 0.458, 

Oi, — 0.083. 

a« = 

r — 0..304, 

0,0 = - - 0.090, 

Oil — - 0.243, 

6, = — 12.779, 

b2 

= — 16.625, 

5, = — 0.323, 

bt = 1.516, 

5, = 1.462, 

&6 

= — 2..583, 

5, = 0.322, 

6, = — 0.216, 

6, = 0.677, 

b\0 

= — 0.4.59, 

6„ = — 0.640. 



The check formulaA (175.3) give 

2)0 = 149.000 = yo , 

2S(6i -)- btt) H” (^2 -j- 5io) *t" V2(fts 5«) -f" V3(6« -f- fts) -f* 2C(6» bj) 

+ 26, = — 22.997 - v, , 

practically. 

Hence the required formula is 
/ 

y = 167.167 — 19.983 cos x — 3.410 cos 2x + 5.471 cos 3x 

— 1.292 cos 4x -f 0.2.50 cos 5x + 0.750 cos 6x -|- 0.309 cos 7x 
-|- 0.458 cos 8x — 0.304 cos 9x — 0.090 cos lOx — 0.243 cos 1 lx 

— 0.083 cos 12x — 12.779 sin x — 16.625 sin 2x - 0.323 sin Sr 
4- 1.516 sin 4x + 1.462 sin bx — 2.583 sin 6x -j- 0..323 sm 7x 

— 0.216 sin 8x + 0.677 sin 9x — 0.459 sin lOx — 0.640 sin llx. 



Art. 176) 


PERIODS OTHER THAN 2r 


579 


176. Periods other than 2«i. When a function is periodic and has a 
period different from iir, we change the independent variable by a linear 
substitution. Thus, if x is the independent variable and the given func- 
tion ia y — f{x), we write 

(1) x = k-j-mfi. 

If the limits for x are y and h and we wish the limits of 0 to be 0 and 
2 ir, we have only to substitute in ( 1 ) these corresponding values of 

X and d and then solve the resulting equations for k and m. Hence in 

this case we have from ( 1 ) 

g = k + 0, or k = g; 

and 

fe = fc 4- 2irm = ff -j- 2irin. 

Hence m = (k — g)/2ir, and the desired formula of transformation is 

(176.1) , = ^ + or d = 

In all these cases the proper formula to assume for y is 

(176. 2) y = Oo -1- a, cos d a* cos 2d -f • • • -)- o„ cos nd 

“I" bi sin d -f- 62 sin 2 d 4 - • • - 4 - bn-i sin (n — l)d. 

For example, if the period of a phenomenon is known to be 18.3 days 

and we wish to use 12 equidistant ordinates, the values of x corresponding 
to these ordinates would be i# = 0, x, = 18.3/12 = 1.525, Xt = 3.050, etc. 
The corresponding values of d would be 0°, 30®, 60°, etc. The values of 
the a’s and b’a in (176.2) would be found in substituting in (176.2) 
these values of d and the corresponding y\ or simply applying the 12- 
ordinate scheme to the given y’s. The resulting formula in terms of x 
would then be, by (176.1) and (176.2), 


(2) , = <m-a.co.(^) + .,co82(^^+-- 

Example. The equation of time for twelve equidistant intervals in a 
certain year is given in the following table. Taking the period of this 
phenomenon to be 365.2 days, find an empirical formula that will give its 
value at any instant in that year. 



680 


HARMONIC ANALYSIS OF EMPIRICAL FUNCTIONS [Chap. XIX 


Partial Solution. Here the value of ia solution of the 

8oo.« 

problem is simplified by reducing the given values of y to seconds of time. 
We then have the following table: 

0 \ 0* I 80* I 60* I 90* 1 120* | 160" 1 

y I 190.9 I 810.4 | 740.6 | 233.6 | —182.7 | —142.6 | ~ 

I 180" I 210® I 240® I 270" | 300" | 330" | 

I 222.0 I 660.0 I 9.8 | —613.8 | —978.2 | —669.6 | 


Treating the y*s according to the scheme of Art. 174 and substituting 
the results into equations (174.2), we find 

0 * = 14.99, <h = — 47.32, a. = 212.96, a, = 5.62, 0 * = - 6.82, 

01 = 26.26, a« = — 14.68. 

6t = 426.39, 6, = 36.34, 6, = — 11.67, 64 = 587.94, 6, = — 14.35. 


The required formula is therefore 

(s^) +*“*‘“* * ( 1 ^) 

(Md (^) 

+*«•**““ (aS) 

+ 58r.94«l.4 (,|^) -14^6 .ins (,^) . 


where y is in seconds of time. 

The reader should note that when the successive values of x are sub- 
stituted into the expression ^ ^ , the results are 80", 60®, etc. Thus, 

8ir , , 866.2 * ,. 

= 7 radians = 80 , etc. 


, _ 866.2 

12 ‘ ^®'^\ 868.2 



866.2 


12 


Ahf I76| 


EXERCISES 


581 


Caution in the Use of Empirical Formulas. Empirical formulas are 
really interpolation formulas of particular forms, and are therefore sub- 
ject to all the limitations of interpolation formulas. They can be relied 
upon for all values of the independent variable within the range of values 
used in determining the coefficients, but should not be trusted outside of 
these limits, except possibly for veiy short distances outside the range of 
values used. Stated otherwise, empirical formulas may be used for inter- 
polation but not for extrapolation. 

If, however, the given function is known to have a certain form for all 
values of the independent variable, we may use the formula for computing 
rough values of the function outside the range of values used in deter- 
mining the coefficients. 


EXBICISBS XX 

1. Find a periodic function that will fit the following data. 


X 

0* 

0 

O 

CO 

60* 

90* 

i 

0 

150* 


210* 

240* 

270* 

300“ 1 330“ 

1 

y 

38 4 

1 

1 

1 11 8 

1 

4 3 

13 8 

3 9 

-18 1 

-22 9 

-27 2 

-23 8 

8 2 

31 7 1 34 2 

1 


2. Do the same for the following: 


X 

m 

15* 


45* 

iQlll 

75* 

m 

o 

aO 

O 

120* 

135* 

150* 

165“ 

180* 

y 

45 

no 

142 

128 

138 

88 

-2 

-12 

-25 

-39 

-21 

-38 

-69 


195* 

1 210* j 

225* 

240* 

255* 

270* 

285* 

300 

' 315* 

1 

o 

\ 1 

1 345* 

-78 

-90 

-112 

-92 

-70 

-45 

25 

68 

1 59 

40 

1 54 


3. Solve completely the Example of Art 176 

4 . The period nf a certain phenomenon is 14.4 days. Twenty-four 
values for equal time intervals are given below. Find an empirical formula 
to represent this phenomenon. 

2.4, 5.6. 6.7, 7.4, H 8. 9.9. 10.4, 12.0, 13.8, 14.9, 16.4, 16.8, 17.5, 
18.4, 19.2, 20 8 21.4. 20.5, 18.5. 16 0. 1.5.1. 14.8. 12.2. 6.4. 











APPENDIX 

VALUES OF THE PROBABILITY INTEGRAL 
P - I «-'*<«, where t - luc. 

v» 


kx 0 12 3 

0.00 0.00000 00113 00226 00339 
0.01 0.01128 01241 01354 01467 
0.02 0.02256 02369 02482 02595 
0.03 0.03384 03497 03610 03722 
0.04 0.04511 04624 04736 04849 

0.05 0.05637 05750 05862 05975 
0.06 0.06762 06875 06987 07099 
0.07 0.07886 07998 08110 08223 
0 08 0.09008 09120 09232 09344 
0 09 0.10128 10240 10352 10464 

0.10 0.11246 11358 11470 11581 
0.11 0.12362 12474 12585 12697 
0.12 0.13476 13587 13698 13809 
0.13 0.14587 14698 14809 14919 
0.14 0.15695 15805 15916 16027 

0.15 0.16800 16910 17020 17130 
0.16 0.17901 18011 18121 18231 
0.17 0.18999 19109 19218 19328 
0 18 0.20094 20203 20312 20421 
0 19 0.21184 21293 21402 21510 

0.20 0.22270 22379 22487 22595. 
0.21 0.23352 23460 23568 23676 
0.72 0.24430 24537 24645 24752 
0.23 0.25502 25609 25716 25823 
0.24 0.26570 26677 26783 26889 

0.25 0.27633 27739 27845 27950 
0.26 0.28690 28795 28901 29006 
0.27 0.29742 29847 29952 30056 
0.28 0.30788 30892 30997 31101 
0.29 0.31828 31922 32036 32139 

0.30 0.32863 32966 33069 33172 
0.31 0.33891 3.3993 34096 34198 
0.32 0.34913 35014 35116 35218 
0.33 0.35928 36029 36130 36231 
0.34 0.36936 37037 37137 37238 

0.35 0.37938 38038 38138 38237 
0.36 0.38933 39037 39131 39230 
0.37 0.39921 40019 40117 40215 
0.38 0 40901 40999 41096 41194 
0.39 0.41874 41971 42068 42164 

0 40 0 42839 42935 43031 43127 
0 41 0.43797 43892 43988 44083 
0 42 0.44747 44841 44936 45030 
0 43 0 45689 45782 45876 45970 
0.44 0.46623 46715 46808 46901 

0 45 0.47548 47640 47732 47824 
0.46 0.48466 48557 48648 48739 
0.47 0.49375 49465 49555 49646 
0 48 0.50275 50365 504S4 50543 
0.49 0.51167 51256 51344*51433 


4 5 6 7 8 0 

00451 00564 00677 00790 00903 01016 
01580 01692 01805 01918 02031 02144 
02708 02820 02933 03046 03159 03271 
03835 03948 04060 04173 04286 04398 
04962 05074 05187 05299 05412 05525 

06087 06200 06312 06425 06537 06650 
07212 07324 07437 07549 07661 07773 
08335 08447 08559 08671 08784 08896 
09456 09568 09680 09792 09904 10016 
10576 10687 10799 10911 11023 11135 

11693 11805 11916 12028 12139 17251 
12808 12919 13031 13142 13253 13365 
13921 14032 14143 14254 14365 14476 
15030 15141 15252 15363 15473 15584 
16137 16248 16358 16468 16579 16689 

17241 17351 17461 17571 17681 17791 
18341 18451 18560 18670 18780 18890 
19437 19547 19656 19766 19875 19984 
20530 20639 20748 70857 20966 21075 
21619 21728 21836 21945 22053 22162 

22704 22812 22920 23028 23136 23244 
23784 23891 23999 24107 24214 24322 
24859 24967 25074 25181 25288 25395 
25930 26037 26144 26250 26357 26463 
26996 27102 27208 27314 27421 27527 

28056 28162 28268 28373 28479 28584 
29111 29217 29322 29427 29532 29637 
30161 30266 30370 30475 30579 30684 
31205 31309 31413 31517 31621 31725 
32243 32346 32450 32553 32656 32760 

33275 33378 33480 33583 33686 33788 
34300 34403 34505 34607 34709 34811 
35319 35421 35523 35624 35725 35827 
36332 36433 36534 36635 36735 36836 
37338 37438 37538 37638 37738 37838 

38337 38436 38536 38635 38735 38834 
39329 39428 39526 39625 39724 39822 
40314 40412 40510 40608 40705 40803 
41291 41388 41486 41583 41680 41777 
42261 42358 42454 42550 42647 42743 

43223 43319 43415 43510 43606 43701 
44178 44273 44368 44463 44557 44652 
45124 45219 45313 45407 45501 45595 
46063 46157 46250 46343 46436 46529 
46994 47086 47179 47271 47364 47456 

47916 48008 48100 48191 48283 48374 
48830 48921 49012 49103 49193 49284 
49736 49826 49916 50006 50096 50185 
50633 50722 50811 50900 50989 51078 
51521 51609 51698 51786 51874 51962 







VALl'ES OF THE PROBABILITY INTEGRAL 


583 


P f where 

V*’ 

hx 1 8 

O.SO 0.S2050 52138 S2226 57313 52401 52488 57576 52663 52750 52837 
0.51 0.52924 53011 53098 53185 53272 53358 53445 53531 53617 53704 
0.52 0.53790 53876 53962 54048 54134 54219 54305 54390 54476 54561 
0.53 0.54646 54732 54817 54902 54987 55071 55156 55241 55325 55410 
0.54 0.55494 55578 55662 55746 55830 55914 55998 56082 56165 56249 

0.55 0.56332 56416 56499 56582 56665 56748 56831 56914 56996 57079 
0.56 0.57162 57244 57326 57409 57491 57573 57655 57737 57818 57900 
0.57 0.57982 58063 58144 58226 58307 58388 58469 58550 58631 58712 
0.58 0.58792 58873 58953 59034 59114 59194 59274 59354 59434 59514 
0.59 0.59594 59673 59753 59832 59912 59991 60070 60149 60228 60307 

0.60 0.60386 60464 60543 60621 60700 60778 60856 60934 61012 61090 
0.61 0.61168 61246 61323 61401 61478 61556 61633 61710 61787 61864 
0.62 0.61941 62018 62095 62171 62248 62324 62400 62477 62553 62629 
0.63 0.62705 62780 62856 62932 63007 63083 63158 63233 63309 63384 
0.64 0.63459 63533 63608 63683 63757 63832 63906 63981 64055 64129 

0.65 0.64703 64277 64351 64424 64498 64572 64645 64718 64791 64865 
0.66 0.64938 65011 65083 65156 65229 65301 65374 65446 65519 65591 
0.67 0.65663 65735 65807 65878 65950 66022 66093 66165 66236 66307 
0.68 0.66378 66449*66520 66591 66662 66732 66803 66873 66944 67014 
0.69 0.67084 67154 67224 67294 67364 67433 67503 67572 67642 67711 

0.70 0.67780 67849 67918 67987 68056 68125 68193 68262 68330 68398 
0.71 0.68467 68535 68603 68671 68738 68806 68874 68941 69009 69076 
0.72 0.69143 69210 69278 69344 69411 69478 69545 69611 69678 69744 
0 73 0.69810 69877 69943 70009 70075 70140 70206 70272 70337 70403 
0.74 0.70468 70533 70598 70663 70728 70793 70858 70922 70987 71051 

0.75 0.71116 71180 71244 71308 71372 71436 71500 71563 71627 71690 
0.76 0.71754 71817 71880 71943 72006 72069 72132 72195 722S7 72320 
0.77 0.72382 72444 72507 72569 72631 72693 72755 72816 72878 72940 
0.78 0.73001 73062 73124 73185 73246 73307 73368 73429 73489 73550 
0.79 0.73610 73671 73731 73791 73851 73911 73971 74031 74091 74151 

0.80 0.74210 74270 74329 74388 74447 7.4506 74565 74624 74683 74742 
0.81 0.74800 74859 74917 74976 75034 75092 75150 75208 75266 75323 
0.82 0.75381 75439 75496 75553 75611 75668*75725 75782 75839 75896 
0.83 0.75952 76009 76066 76122 76178 76234 76291 76347 76403 76459 
0.84 0.76514 76570 76626 76681 76736 76792 76847 76902 76957 77012 

0.85 0.77067 77122 77176 77231 77285 77340 77394 77448 77502 77556 
0.86 0.77610 77664 77718 77771 77825 77878 77932 77985 78038 78091 
0 87 0.78144 78197 78250 78302 78355 78408 78460 78512 78565 78617 
0.88 0.78669 78721 78773 78824 78876 78928 78979 79031 79082 79133 
0.89 0.79184 79235 79286 79337 79388 79439 79489 79540 79590 79641 

0.90 0.79691 79741 79791 79841 79891 79941 79990 80040 80090 80139 
0 91 0.80188 80238 80287 80336 80385 80434 80482 80531 80580 80628 
0 92 0.80677 80725 80773 80822 80870 80918 80966 81013 81061 81109 
0.93 0.81156 81204 81251 81299 81346 81393 81440 81487 81534 81580 
0.94 0.81627 81674 81720 81767 81813 81859 81905 81951 81997 82043 

0.95 0.82089 82135 82180 82226 82271 82317 82362 82407 82452 82497 
0.96 0.82542 82587 82632 82677 82721 82766 82810 82855 82899 82943 
0.97 0.82987 83031 83075 83119 83162 83206 83250 83293 83337 83380 
0.98 0.83423 83466 83509 83552 83595 83638 83681 83723 83766 83808 
0.99 0.83851 83893 83935 83977 84020 84061 84103 84145 84187 84229 



584 


VALUES OF THE PROBABILITY INTEGRAL 

P f where t » hx. 


h* 0 12 3 

1.00 0.84270 84312 84353 84394 

1 .01 0.84681 84722 84762 84803 

1.02 0.85084 85124 85163 85203 

1.03 0.85478 85517 85556 85595 

1.04 0.85865 85903 85941 85979 

1.05 0.86244 86281 86318 86356 

1.06 0.86614 86651 86688 86724 

1.07 0.86977 87013 87049 87085 

1.08 0.87333 87368 87403 87438 

1.09 0.87680 87715 87749 87783 

1.10 0.88021 88054 88088 88121 

1.11 0.88353 88386 88419 88452 

1.12 0.88679 88711 88743 88775 

1 . 13 0.88997 89029 89060 89091 

1.14 0.89308 89339 89370 89400 

1.15 0.89612 89642 89672 89702 

1.16 0.89910 89939 89968 89997 

1.17 0.90200 90229 90257 90286 

1.18 0.90484 90512 90540 90568 

1.19 0.90761 90788 90815 90843 

1.20 0.91031 91058 91085 91111 

1.21 0.91296 91322 91348 91374 

1.22 0.91553 91579 91604 91630 

1.73 0.91805 91830 91855 91879 

1.24 0.92051 92075 92099 92123 

1.25 0.92290 92314 92337 92361 

1.26 0.92524 92547 92570 92593 

1.27 0.92751 92774 92796 92819 

1.28 0.92973 92995 93017 93039 

1.29 0.93190 93211 93232 93254 

1.30 0.93401 93422 93442 93463 

1.31 0.93606 93627 93647 93667 

1.32 0.93807 93826 93846 93866 

1.33 0.94002 94021 94040 94059 

1.34 0.94191 94210 94229 94247 

1.35 0.94376 94394 94413 94431 

1.36 0.94556 94674 94592 94609 

1.37 0.94731 94748 94766 94783 

1.38 0.94902 94918 94935 94952 

1.39 0.95067 95084 95100 95116 

1.40 0.95229 95244 95260 95276 

1.41 0.95385 95401 95416 95431 

1.42 0.95538 95553 95568 95582 

1.43 0.95686 95700 95715 95729 

1.44 0.95830 95844 95858 95872 

1.45 0.95970 95983 95997 96011 

1.46 0.96105 96119 96132 96145 

1 .47 0.96237 96250 96263 96276 

1.48 0.96365 96378 96391 96403 

1.49 0.96490 96502 96514 96526 


4 5 6 7 8 9 

84435 84477 84518 84559 84600 84640 
84843 84883 84924 84964 85004 85044 
85243 85282 85322 85361 85400 85439 
85634 85673 85711 85750 85788 85827 
86017 86055 86093 86131 86169 86206 

86393 86430 86467 86504 86541 86578 
86760 86797 86833 86869 86905 86941 
87120 81156 87191 87227 87262 87297 
87473 87507 87542 87577 87611 87646 
87817 87851 87885 87919 87953 87987 

88155 88188 88221 88254 88287 88320 
88484 88517 88549 88582 88614 88647 
88807 88839 88871 88902 88934 88966 
89122 89154 *89185 89216 89247 89277 
89431 89461 89492 89522 89552 89582 

89732 89762 89792 89821 89851 89880 
90027 90056 90085 90114 90142 90171 
90314 90343 90371 90399 90428 90456 
90595 90623 90651 90678 90706 90733 
90870 90897 90924 90951 90978 91005 

91138 91164 91191 91217 91243 91269 
91399 91425 91451 91477 91502 91528 
91655 91680 91705 91730 91755 91780 
91904 91929 91953 91978 92002 92026 
92147 92171 92195 92219 92243 92266 

92384 92408 92431 92454 92477 92500 
92615 92638 92661 92684 92706 92729 
92841 92863 92885 92907 92929 9295 1 
93061 93082 93104 93126 93147 93168 
93275 93296 93317 93338 93359 23380 

93484 93504 93525 93545 93566 93586 
93687 93707 93727 93747 93767 93787 
93885 93905 93924 93944 93963 93982 
94078 94097 94116 94135 94154 94173 
94266 94284 94303 94321 94340 94358 

94449 94467 94485 94503 94521 94538 
94627 94644 94662 94679 94697 94714 
94800 94817 94834 94851 94868 94885 
94968 94985 95002 95018 95035 95051 
95132 95148 95165 95181 95197 95213 

95292 95307 95323 95339 95354 95370 
95447 9546“’ 95477 95492 95507 95523 
95597 95612 95627 95642 95656 95671 
95744 95758 95773 95787 95801 95815 
95886 95900 95914 95928 95942 95956 

96024 96038 96051 96065 96078 96092 
96159 96172 96185 96198 96211 96224 
96289 96302 96315 96327 96340 96353 
96416 96428 96440 96453 96465 96478 
96539 96551 <>6563 96575 96587 96599 







VALUES OF THE PROBABILITY INTEGRAL 685 

J* - 4= f where t - kx. 

y/w Jo 









VALUES OF THE PROBABILITY INTEC.UAL 


2 r* 

P ■ I V^dt, where / ■ *jt. 

Ax|o 12345678 

0.99959 99961 99963 99965 99967 99969 99971 99972 99974 99975 
0 99976 99978 99979 99980 99981 99982 99983 99984 99985 99986 

0 99987 99987 99988 99989 99989 99990 99991 99991 99992 99992 

0 99992 99993 99993 99994 99994 99994 99995 99995 99995 99996 

0 99996 99996 99996 99997 99997 99997 99997 99997 99997 99998 

0 99998 99998 99998 99998 99998 99998 99998 99998 99999 99999 








INDEX 


(The numbers refer to pages) 


A 

Absolute error, 4 

Acceleration of gravity, formula for, 42 
Accidental errors, 460 
Accuracy in determination of arguments,. 
28 

in evaluation of formulas, 24 
Of addition, 10 
of averages, 11, 12 
of division, 16, 16 
of interpolation formulas, 102 
of linear interpolation, 112 
of logs and anti logs, 19 
of measurements, 493 
of multiplication, 14 
of powers and roots, 18 
of products and quotients, 14, 15, 16, 17 
of series approximations, 32 
of subtraction, 12, 13, 14 
of solution of difference equations, 405, 
406 

of differential equations, 367 
of systems of linear equations, 303 
Adams, J. C., 323, 327 
Addition, errors of^ 10 
Adopted values of constants, 25 
Algebraic equations, special procedure 
for, 207 

Alternating series, error in, 34 
Analysis, harmonic, of empirical func- 
tions, 564 

Antilogarithms, accuracy of, 19, 20 
Approximate numbers, 2 
Arguments, accuracy in determination of, 
28 

Astronomy, practical, fundamental equa- 
tions, of, 43, 44 
Asymptotic series, ^160, 475 
Average deviation, 499 
error, 490 

Averages, accuracy of, 11 
method of, 528 

B 

Backward interpolation, Newton’s form- 
ula for, 59 


Ballistic equations, 373, 374, 375 
Ballistics, fundamental equation of, 373 
Bairstow, L., 253 
Barker, J. E., 376 
Barker method, 376, 383 
how to use, 379 
Bash forth, F., 323 

Bessel’s formula of interpolation, 84. 85 
for interpolating to halves, 85 
symmetrical form of, 85 
when to use, 89 

Best type of empirical formula, finding, 
554 

Biermann, O., 54, 122 
Binomial series, remainder in, 35 
Block relaxation, 416 
*Borel, E., 102 

Boundary-value problems, 438, 450 
Bradley, J., 491 
Brodetsky, S., 245 

C 

Cajori, F., 201 
Carvallo, 259 

Caution in use of empirical formula!, 581 
in use of quadrature formulas, 168 
Central-difference formulas, 
of interpolation, 79 
quadrature, 144, 148 

geometric significance of, 148 
remainder terms in, 189 
Charlier, C. V. L., 165, 167, 193 
Chauvenet, W., 476 

Check formula for coefficients in root- 
squared equations, 233 
Check formulas, for 12 ordinates, 672 
for 24 ordinates, 577 
Combination of sets of measurements, 
503, 504 

Complex roots, detection of, 234 

computation of, by Graeffe’s method, 
234 

Conditional equations, 564, 565 
Conditions for convergence of iteration 
process for algebraic and transcen- 
dental equations, 212, 223, 301 
of Picard’s method, 352 


587 


38 



588 


INDEX 


Conformal transformation, 407 
Constants, adopted values ot, 25 
Convergence, conditions for, 

in iteration process, 212, 223, 301 
in Picard’s method, 352 
Cramer, Gabriel, 266 
Cramer’s rule, 266, 268 
Criterion for negligible effects, 514 
Curve, Gaussian, 468 
Cubature, mechanical, 172 
formula for, 173 

general statement concerning, 174 
D 

Davids, Norman, 156 
Derivatives, nth, 38 

Derivatives and differences, relation be- 
tween, 54 

Detection of complex roots, 234 
Determinants, 
errors in, 30 

evaluation of numerical, 
by expansion in minors, 260 
by pivotal method, 261 
by triangular method, 263 
triangular, 263 
Deviation, average, 499 
standard, 499 

Diagonal differences and horizontal dif 
ferences, relations between, 40 
Diagonal difference table, 48 
Difference equations, 394 
quotients, 392 
table, diagonal, 48, 60 
horizontal, 49, 60 
Differences, 48 

and derivatives, relations between, 64 

central, 79 

double, 121 

of a polynomial, 64 

two-way, 121, 122 

Differential equations, ordinary, numeri- 
cal solution of, 

by difference polynomials, 320 
by Euler’s method, 310 
by Milne’s method, 353 
by Picard’s method, 316 
by Runge-Kutta method, 358 
by StOrmer method, 344 
partial, numerical solution of, 
by iteration, 396 
' by relaxation, 410 
by Rayleigh-Ritz method, 422 


starting the solution of, 
by Euler method, 311 
by Milne’ formulas, 330 
by Runge-Kutta method, 369 
^by Taylor series method, 328 
Differentiation, numerical, 133 

partial, of tabulated functions, 135 
Direct measurements, 493 
Divided differences, definition of, 65 
relation to simple differences, 68 
symmetry of, 67 
tables of, 66 
Division, accuracy of, 16 
Double differences, 121 
general formula for, 122 
Double interpolation, 

by repeated single interpolation, 114 
formula for, 124 

remainder term of, 124 

E 

Elliptic integrals, 30 
Emmons, H. VV., 422 
Empirical formulas, 522 
caution in use of, 581 
finding best type of, 554 
finding constants in, 
by method of averages, 528 
by method of least squares, 533 
by plotting, 522 

when both variables are subject to 
error, 551 

when residuals are weighted, 542 
general case of non-line&r formulas, 545 
Encke, J. F., 272 
Equal effects, method of, 513 
principle of, 26 

Equations, algebraic and transcendental, 
194 

location of roots pf, 194, 196 
solution of, 
by interpolation, 197 
by iteration, 208, 219 
by Newton-Raphson method, 201, 215 
by repeated plotting on larger scale, 
199 

Equations, ballistic, 373-375 
differepce, 394 
differential, ordinary, 310 
of exterior ballistics, 373-375 
of first order, 310-336 
of second order, 337-348 
special, of second order, 342-348 
simultaneous, 348-350 
partial, 391-436 



INDEX 


Equations, error, 468 
functional, 466 
heat conduction, 395 
integral, 437-459 
linear, 445-449 
non-linear 460-459 
Laplace, 394 

linear systems, accuracy of solution of, 
303 

normal, 535 
Poisson, 395 
probability, 468 
residual, 534 

simultaneous linear, 260-309 
accuracy of solution of, 303-307 
numerical solution of, 
by determinants, 266-269 
by method of division by leading 
coefficients, 269-271 
by Gauss’s method, 272-277 
by inversion of matrices, 296-297 
by iteration, 297-300 
Error, average, 496 
effect of, in tabular value, 52 
equation, 468 
function, 464 

inherent in Gauss’s quadrature ^for- 
mula. 191 

in Simpson’s Rule, 183 
in Weddle’s Rule, 189 
in solution of difference equations, 
405 

mean square, 495 
probable, 495 

Errors, general formula for, 9 
in addition, 10 
in determinants, 39 
in division, 15 
in logarithms, 19 
in multiplication, 14 
in powers and roots, Id 
in solution of differential equations, 
307 

in solution of simultaneous linear 
equations, 303 
in subtraction, 12 
])ercentage, 4 

probability of, between given limits, 
462 

propagation of, 511 
relative, 4 
systematic, 460 

Euler’s method of solving differential 
equations numerically, 310 
modified, 312, 375 


Euler’s quadrature foraula, 165 
inherent error in, 192 
remainder term in, 192 
Euler’s summation formula, 166 
Evaluation of determinants, 260 
of formulas, accuracy in, 24 
the two problem,s in, 24 
of probability integral, 473 
Exponential series, remainder in, 35, 36 

P 

False position, method of, 197 
Figures, significant, 2 
Finding best type of empirical formula, 
554 

Formulas, check, 572, 577 
empirical, 522 
Milne, 354 

non-linear empirical, 545 
Forward interpolation, formula for, 58 
Function, error, 464 
Green’s, 440 
table, 114 

Functional equation, 466 
Fundamental equation for errors in argu- 
ments, 29 

of exterior ballistics, 373 
G 

Gans, R., 391 

Gauss, C. F., 132, 152, 272 
Gaussian curve, 468 
Gauss’s interpolation formulas, 79-81 
backward formula, 80 
forward formula, 79 
third formula, 81 

Gauss’s method of solving simultaneous 
linear equations, 272-277 
quadrature formula, 152 
disadvantages of, 169 
inherent error in, 191 
Geometric significance of 
central-difference quadrature formulas, 
148 

Newton-Raphson method, 203, 204 
precision measures, 497 
Simpson’s Rule, 138 
Weddle’s Rule, 139 
Weierstrass’s theorems, 47 
Geometry of iteration, 210 
Goursat, E., 438, 450 
Goursat-Hedrick, 177 
Graeffe, 225 



590 


INDEX 


GrnofTe's root-squaring method, 22n 
for complex roots, 234 
for equal roots, 243 
Hro(let«^ky and Smears improvement of, 
245 

extension of, by Carvallo, 259 
iinproxin^ the accuracy of, 257 
for real roots, 228 
principle of, 225 

Rainbow's check on coefficients in, 232 
Graphic method of determining constants 
in empirical formulas, 522 
of solving equations, 194 
Gravity, formula for acceleration of, 42 
Gregory, J., 58 
Green’s function, 440 
Grouping of equations, 530 

H 

Halves, formula for interpolating to, 85 
Halving the interval, 334 
Harmonic analysis of empirical func- 
tions, 564 

Heat-conduction equation, 395 
Herniite’s formula, 130 
Horizontal difference table, 49 

I 

Index of percision for errors, 468 
for residuals, 488 
Indirect measurements, 493 
the two fundamental problems of, 513 
Inherent error in Gauss’s quadrature 
formula, 191 

in Newton Raphson method, 206 
in prismoidal formula, 180 
in Simpson’s Rule, 183 
in solution of partial differential equa- 
tions by difference equations, 405 
in Weddle’s Rule, 189 
Integral equations, 317, 437 
linear, 445 
non-linear, 450 
Integrals, elliptic, 30 
Integrating ahead, formula for, 321 
Integration, numerical. See Numerical 
Integration 

Interpolation, definition of, 46 
accuracy of, 102 
backward, formula for, 59 
Bessel’s formulas for, 84, 85 
forward, formula for/ 56 
double, 114-130 
inverse, 93 

Lagrange’s, formula for, 74 


Newton’s general formula of, 70 
M*ries, 102 

to halves, formula for, 85 
trigonometric, formula for, 130 
Interval, having the, of h, 334 
Inverv* interpolation, 93 
by Lagrange's formula, 93 
by reversion of series, 96 
by succesive approximations, 93 
Iteratiim, method of, for finding roots, 
208, 219 

for solving dilferenee equations, 396 
for solving integral equations, 317, 437 
for solving systems of linear equations, 
297 

process for algebraic and transcen- 
dental equations, 208, 219, 455 
convergence of, 211, 221, 301 
rule for, 302 
‘geometry of. 210 

and relaxation methods compared, 420, 
421 

J 

Jahnke and Emde, 30 

K 

Kernel, of integral equation, 437 
Kooy, J. M. J., 379 
Kutta, W., 359 

L 

Lagrange’s formula of interpolation, 74 
remainder term in, 103, 104, 108 
uses of, 75 
Lattice points, 392 
Law of accidental errors, 460 
Law of ernir of a function, 468, 472 
for residuals, 487 
Least squares, method of, 533, 543 
principle of, 481, 484 
I^ievinson, A., 156 
Liebmann, H., 391 

Linear equations, solution of simultan 
ecus algebraic, 200-309 
accuracy of solutions of, 303 
function, law of error of, 468, 472 
integral equations, 445 
interpolation, accuracy of, 112 
with several arguments, 124 
Lobatto, 159, 192 

Lobatto’s quadrature formula, 159 
application of in solution of integral 
equations, 458 



INDEX 


501 


Logarithmic series, remainder in, 37 
paper, 525, 527, 555 
Logarithms, accuracy of, 10 
I^vett, W. V., 450 
Lowun, A. N., 150 
Liiroth, J., 32 

M 

Maclaurin’s series, remainder term in, 33 
Malmsten 8 formula, 192 
Matrices, addition and subtraction of, 278 
column, 278 
inversion of. 284-297 
multiplication of, 279 
by a number or scalar, 279 
by another matrix, 280-283 
unscrambling of, 290, 293-295 
unit, 278 

Matrix, definition of, 277 
of coefficients, 280 
inverse, 284 
non-singular, 284 

Maxima and minima of tabulated func- 
tions, 134 

Mean, weighted, 485 
Mean square error, 494 
Measurements, direct, 493 
indirect, 493 
rejection of, 519 
Measures of precision, 494 
computation of, from residuals, 500-503 
geometric significance of, 497 
relations between, 490 
Mechanical quadrature, definition of, 136 
Mechanical ciibature, 130, 172 
general rule for, 174 
Membrane, vibrating, 430 
Method of averages, 528 

of Barker for starting trajectories, 376 
of equal effects, 513 
of Euler, 310 
of false position, 197 
of Graeffe, 225 

of interpolation, for finding roots, 197 
of iteration for finding roots, 208, 219, 
297 

for solving difference equations, 396 
for solving simultaneous linear equa- 
tions, 297 

of least squares, 533, 543 
of Milne, for solving differential equa- 
tions, 353 

of Picard, for solving differential equa- 
tions, 316 


of relaxation, 410 
of Hunge and Kutta, 359 
of selected points for finding contftnta 
in empirical formulas, 522 
pivotal, of evaluating determinants, 261 
Rayleigh-Ritz, 422 

Stormer-Milne, for special second-order 
equations, 344 

triangular, of evaluating determinants, 
263 

Methods of solving partial differential 
equations, 391-436 

of solving simultaneous linear equa- 
tions, 260-302 

of starting solutions of differential 
equations, 328, 329, 375 
Miller, F. H., 423 
Milne, W. E.. 329, 353 
Milne’s method for solving differential 
equations, 353 

formulas for starting the solution of a 
differential equation, 330 
formulas for solving differential equa- 
tions, 354 
Mistakes, 460 

Modulus of complex roots, theorem re 
lating to, 241 
Montel, P., 102 
Moors. B. P., 156, 192 
Moulton, F. R., 327, 375 
Multiplication, accuracy of, 14 

N 

Negligible effects, criterion for, 514 
Networks, triangular, 415 
Newton, I., 201 

Newton-Raphson method of solving equa- 
tions, 201 

convergence of, 212 
for simultaneous equations, 215 
geometric significance of, 203 
iniierent error in, 205 
Newton’s formula (II) for backward 
interpolation, 59 

(I) for forward interpolation, 56 
Non-linear empirical formulas, 545 
integral equations, 450 
Normal equations, 535, 543 
rule for writing down, 535, 543 
Normal probability curve, 462, 468 
Nth derivative, table of, 38 
Numbers, approximate, 2 
rounded. 2 

Numerical differentiation, 133 



5M 


INDEX 


Numerical integration, 136-182 
by central-difference quadrature formu- 
las, 145, 148 
by Euler’s formula, 165 
by Gauss’s formula, 153 
by Lobatto’s formula, 160 
by Simpson’s Rule, 137 
by Tchebyclieff’s formula, 162 
by Weddle’s Rule, 138 
Numerical solution of ordinary differen- 
tial equations, advantages and 
disadvantages of, 370 
by aproximating polynomials, 320 
by Euler’s method, 310 
by Milne’s method, 353 
by Picard’s method, 316 
by Runge-Kutta method, 359 
starting the, 328, 375 
Numerical solution of partial differential 
equations, 
by iteration, 396 
by relaxation, 410 
by Rayleigh-Ritz method, 422 
Nystrum, E. J., 438 

0 

Obser\ations, rejection of, 519 
weighted, 482 
Overrelaxation, 415 

P 

Palmer, A. de F., 26, 514, 515 
Partial derivatives of tabulated func- 
tions, 135 
Pearson, K., 130 
Percentage error, 4 
probable error, 512 
Periods other than 2ir, 579 
Picard, method, 316 
Pivotal element, 261 
equation, 269, 272 

method of evaluating determinants, 
261-263 

Plotting, method of. repeated, 199 
Points, lattice, 392 
method of selected, 522 
Poisson’s equation, 395 
Polynomial, differences of a, 54 
when nth differences are constant, 556 
Polynomials, approximating, 320, 321 
Pope, Alexander, 40 

Postmultiplication and premultiplication 
of matrices, 283 

Powers and roots, accuracy of, 18 


Practical astironomy, fundamental equa- 
tions 6f, 43-44 

Precision and accuracy, distinction be- 
tween,! 493 
index for eirrors, 468 
for residuals, 488 
measures, 494 

computation of, from residuals, 500 
501 

geometric significance of, 497 
relations between, 496 
Principle of equal effects, 26 
of Graeffe’s method, 225 
of least squares, 481, 484 
Prismatoid, definition of, 179 
Prismoid, definition of, 177 
Prismoidal formula, 178 
Prismoids and prisniatoids, distinction 
between, 179 

Probability equation, for errors, 468 
for residuals, 488 
curve, normal, 462 
integral, evaluation of, 473 
tables of, 582 

of errors lying between given limits, 
462 

of hitting a target, 476 
Probable error, computation of, from 
residuals, 500 
definition of, 495 
formulas for, 500, 503, 511, 512 
in indirect measurements, 511, 512 
meaning of, 509 

of arithmetic and weighted means, 500 
of a function whose p.e.’s are known, 
51 

Probable error and weight, relation be- 
tw'een, 499, 500 
Probable percentage error, 512 
relative error, 511, 512 
Product, accuracy of, 14 
relative error of, 14 
Propagation of errors, 511 

Q 

Quadrature, mechanical, 136 
formulas, caution in use of, 168 
central difference, 144, 148 
Euler’s, 165 
Gauss’s, 152 

general, for equidistant ordinates, 136 

Lobatto’s, 160 

•Simpson’s, 137 

Tchebycheff’s, 162 

Weddle’s, 138 



INDEX 


693 


Quotient, accuracy of, 16 
relative error of, 15 
Quotients, difference, 392 

R 

Rainbow, R., 232 
Rainbow’s check formula, 233 
Ran^e, of projectile, 389 
Rapiison, 201 

Rayleigh, Lord (J. W. Strutt), 423 
Raylei^h-Ritz method, 422 
Reciprocals of roots, relations between 
coefficients and, 237 
Re^^ula falsi method, 197 
Rejection of observations and measure- 
ments, rule for, 519 
Relation between 
differences and derivatives, 54 
precision measures, 497 
probable error and weight, 499 
roots and coefficients, 228, 237 
Relaxation, block, 416 
method of, 410 
Relative error, 4 

and signiOcant figures, relation be- 
tween, 4 

theorems concerning, 5-8 
of a product, 14 
of a quotient, 15 
probable, 511, 512 
Remainder term 

in Bessel’s formulas, 100, 108 
in binomial series, 35 
in central-difference quadrature formu- 
las, 189 

in Ruler’s formula, 192 
in formula for double interpolation, 
124 

in formula for interpolating to halves, 
108 

in Gauss’s formula, 191 
in Lagrange's formula, 103, 108 
in linear interpolation, 112 
in Newton’s formula (I), 102, 107 
in Newton’s formula (II), 104, 106, 107 
in Simpson’s Rule, 183-187 
in Stirling’s formula, 105-108 
in Weddle’s Rule, 189 
Repeated plotting, method of, 199 
Residual, 410, 484, 528 
equations, 529, 534 
Residuals, 484, 528 

in terms of errors, 489, 490 
law of error for, 487 
of measurements, 524 


of plotted points, 528 
probability equation for, 488 
sum. of, theorem concerning, 486 
weighted, 541-543 
Rice, H. L., 54, 50 
Richardson, L. F., 3^ 

Ritz, W., 423 
Rockets, 379 

Roots, complex, detection of, 234 
computation of by interpolation, 197 
by iteration, 208, 219, 297 
by Newton-Raphson method, 201, 215 
by regula falsi method, 197 
by repeated plotting, 199 
errors in, due to errors in coefficients, 
213 

finding approximate values of, 194 
Graeffe’s method for finding, 225 
location of, 194 
real and equal, 243 
Root-squaring process, 226 
principle of, 225 
rule for applying, 228 
when to discontinue, 229-230 
Rounding of numbers, rule for, 3 
Runge, C., 358, 556 
Riinge and Konig, 360, 566 
Runge-Kutta method, 333, 358 
for first-order equations, 359 
for second -order equations, 361 
for simultaneous equations, 362 
inherent error in, 360 
special case of, 359 

S 

Scheme for 12 ordinate<«, 564 
for 24 ordinates, 574 
Semi-logarithmic paper, 526 
Series, alternating, error in, ,34 
asymptotic, 166, 475 
binomial, remainder term of, 35 
exponential, remainder term in, 36 
interpolation, 102 
logarithmic, remainder term in, 37 
Maclaurin’s remainder term in, 33 
Taylor’s, remainder term in, 32, 33 
Series approximations, accuracy of, 32 
Sets of measurements, combination of 
when p.e’s are given, 503, 504 
Shortley, G. H., 391 
Significant figures, 2 
in powers, roots, logs, and antilogs, 21 
in products and quotients, 21 
loss of, by subtraction, 12 
relation of, to relative error, 6-8 



594 


INDEX 


Simpson’s Rule, 137 
formulas for inherent error in, 183-187 
geometric signiftcance of, 138 
Simultaneous equations, 
algebraic and transcendentals, 215 
differential, 349 

linear, accuracy of solutions of, 303 
solution of, 
by determinants, 268 
by division by leading coefficients, 
269 

by Gauss method, 272, 274 
by inversion of matrices, 296 
by iteration, 297 
Smeal, G., 245 
Smithsonian tables, 132 
Southwell, R. V., 391, 422 
Special equations of second order, 342 
Special procedure for algebraic equations, 
207 

Standard deviation, 499 
Starting a solution, methods of, 328 
Starting values, methods of Onding, 375 
Steffensen, J. F., 38, 102 
Stirling’s formula of interpolation, 82 
as a power series, 98 
compared with Bessel’s, 89 
when to use, 89, 109 
Stormer, C., 344 
StOrmer’s formula, 344 
String, vibrating, 423 
Strutt, J. W., 423 
Subtraction, accuracy of, 12, 13 
loss of leading significant figures by, 12 
Systematic errors, 460 
Systems of algebraic and transcendental 
equations, 215 

of differential equations, 349 
of linear algebraic equations, 259-309 
accuracy of solution of, 303 
of special second-order equations, 348 

T 

Table, function, 114 
Tables of differences, 48-50 
of probability integral, 582-586 
Tabular value, effect of error in, 52 
Tannery, J., 2 

Target, probability of hitting, 476 
Taylor’s formula, remainder term in, 32, 
33 

Tchebycheff, P., 162, 185, 192 


Tchebycheff’s formula, 162 
derivation of, 163 

Theory and experience, agreement be- 
tween, 491 

Todhunter, I., 154, 191 
Transformation, conformal, 407 
Trapezoidal Rule, 142 
Traverse, 397 

Triangular determinant, 263 

method of evaluating determinants, 
263, 264 
networks, 415 

Trigonometric interpolation, 130 
series, case of 12 ordinates, 564 
case of 24 ordinates, 574 
Two-way differences, 121, 122 

U 

Uytenbogaart. J. W. H., 379 

V 

Valine Poussin, C. J. de la, 54, 165, 184, 
398 

Value of h for stipulated accuracy in 
integral, 188 
Van Orstrand, C. E., 544 
Vibrating membrane, 430 
string, 423 

W 

Weddle’s Rule, 138 
geometric significance of, 139 
inherent error in, 189 
Weierstrass, K., 47 
two theorems of, 47 
Weight, definition of, 483 
of a function, 542 

Weight and probable error, relation be- 
tween, 499, 500 
Weighted mean, 485 
normal equations, rule for writing 
down, 543 
observations, 483 
residuals, 541-543 
Weller, R., 391 
Wentworth, G. A., 530 
Whittaker and Robinson, 114, 165 
Whitworth, W. A., 530 
Willers, F. A., 360, 391 
Wilson, E. B., 177, 398 



ANSWERS TO EXERCISES 


ly Page 40 

1. 63.85, 93490 or 9349 X 10, 0.006394, 

83620 or 8362 X 10, 3630 X 10*, 0.09004, 

63910 or 5391 X 10. 

2. Beam measurement. 3. 571. 4. 5529 or 5528. 

5. 6804.0 or 6804.1. 6. 0.0206850. 

7. Between 860 and 865. 8. 9 ft./sec. 

9. 37.1 ± 2.8 ft./sec. 

10. 6,250,000 ft. lbs.; 6,211,180 ft. lbs.; 

6,216,972 ft. lbs. 

11. y =0.00025, ^ =0.000125. 

12. 100 — = 0.177 ; 100 ~ = 0.98. 

r ft 

13. Atf = 0.00079458 radian = 2' 43". 

14. 0.1%. 15. dL = l'0.24" 

16. dft = 0'.6, di = 0.045234 radian = 10™ 22'». 

17. dLx — 18".9, dL* = 45".4, dAx = 30".6, dAj = 30".6. 

18. For A = 10°, dL = 1' 21"; 
foril=80°, dL = 16'37". 

19. (a) dA = 29".5, d< = 14*.8; 

(b) dh = 7".8, di = 0*.7. 

20. For A = 10°, dt = 145*.2 = 2"* 25».2 ; 
for A = 80°, di — 9'.9. 

21. For A = 10°, dL = 3".0, dh = 3".0 ; 

for A = 80°, dL = V 37".8, dh = 17".0. 

22. For A = 10°, dL = IM ; 
for A = 80°, dL = 9'.0. 

23. Al is most potent as t increases. 

M. 0.77210. 25. 1.5061. 


59 .) 



ANSWERS TO EXERCISES 


r>9e 


86 . 


dk< 


sin sin \(tx — ta)vo»h^ 


(dk + 


sin ^{h, — htf 
sin |(Ii — I,) 


dt). 


87. VA. 


II. Page 63 

1. 65540 should be corrected to 65536. 

8. Fourth line should be 19°51' 59''.8. 

3. 8.0363956 — 10. 4 . 261® 54' 14".7. 

6. 8.0891991 — 10. 6. 274® 43' 22". 

III, Page 77 

1. y = 0.77729893. 

8. p = 177.4. 

3. y = 0.675828379. 

4 . y = 16.79; x = 97.66. 


IV, Page 90 

1. 8.2175401 — 10. 2. 0.691960629. 

3. 0.6448325. 4. I'.53".9. 6. 12® 55' 12".94. 

6 . 0.436185128. 


V, Page 101 
0.73811340. 

VI, Page 113 

1. £z. 3, /tn = 0.030 X 0.0000001 ; fix. 4, = 0".023. 

8 . fix. 2, A. = 0.016 X 0.000000001 ; fix. 3, A» = 0.11 X 0.0000001 ; 
fix. 4, A, = 0".0012; Ex. 6, A. = 0.066 X 0.000000001. 


1. 6'‘48'"25*. 


VII, Page 132 



ANSWERS TO EXERCISES 


397 


VIII, Page 180 

1. 5.6972. 8. 0.4623. 3. 6*' 5*" 21*.9 a. M., June 22. 

4 . By Simpson’s Rule, 1.505103; 
by Weddle’s Rule, 1.505103. 

5. 0.92703608. 6. 293.4. 7. 1.0101996 

8. 0.90452. 9. 0.113822. 10. —0.09485. 

11. 0.9480, by Simpson’s Rule. 18. 0.13340. 

18. (a) 0.3585; (b) 0.3201; (c) 0.3104; (d) 0.2444. 

IX, Page 193 

1. 'The result 1.505103 is correet to its last digit in b(»th cases 
8. ^, = — 0.0207. 

3. The result 0.904.*) I is correct to its last digit. 

X, Page 223 

3. 3.7893. 4. 6.1647. 5. 2.883238. 6. 0 12213 

7. 1.723. 8. 0.93825. 9. 0.15368. 

10. 2.138, - 1.069 ± 2.257i. 11. 1.0649. 

12. 1.44575. 18. x = 0.22684, y = 0.36962. 

14. a; = 0..567325, y = 1.857378. 

15. ± 0.20292, ± 0.37077, ± 0.47455. 

16. . ± 0.65034. 


XI. Fage 259 

1. 3.5616 ± 2.4931, 0.0284 ± 0.3241». 

8. 0.06710 ± 1.351i, - -0.003604 ± 0.9628i. 


8. 1.06393,-1.31327, 


0.17571 ± 1.37214t, —0.051040 ± 0.94212t. 


4. 5.2555, 0.9676 ± 0.3272t, 

^ 0.7870 ± 0.5764i, 0.3833 

5. — 1.5818592 

0..5664427 ± 1.3686572* 
0.9623278 ± 0.7135689 1 
— 0.7378408 ± 0.8124513 ». 


9. —5.6248 

— 1.1212 ±0.9752* 

— 0.7446 ± 1.2417* 

— 0.2697 ±0.9600* 
2.1748. 



698 


ANSWERS TO EXERCISES 


XII, Page 307 

1. 239. 8. 239. 3. * = 3. 4. = 0.8072, x, = 0.2372, — 0.1046, 

*4 = — 0.3581. 5.79.98. 6. * = 2, y = — 1, * = 3, < = 6. 8. x = 0.444j 
y = 0.563, z =0.324, t = 0.723. 

9. X = 1.7087 Ax = 0.1044 

y = — 1.8003 Ay = 0.1152 

z = 1.0491 A« = 0.1868. 

10. Ax = 0.1248, Ay = 0.0116, A« = 0.0933, Att = 0.0295. 


XVI, Page 492 

1. 0.3213885. 

8. 0.8846. 

3. 0.331. 

4. 0.999978. 

5. 0.00115; 870. 

6. 364.14. 

7. 370.22 ; Su;v = 0.16 ; Swe* = 15.0992. 

XVII, Page 509 

1. r = 0.0071; if, = 9.6639 + 0.0022. 

8. r=l".455; if, = 116*43' 49".696 ±: 0".420. 

3. if, = (3.6462 ± 0.0012) X lO**. 

4. if, = 0.00356 db 0.00016. 

5. i/o = 36*41' 24".04 ± 2".70. 

6. if, = 299937 ± 44. 

7. if, = 107.9374 ± 0.0027. 


XVIII, Page 520 

\. A = 81*12' ± 1'.8 ; a = 114.71 ± 0.18. 

8. c = 162.41 + 0.04. 

3. ft = 1.37328 d: 0.00013. 

4. 7 = 1.3745 ±0.0053. 



\X8WERS TO EXERCISES 


699 


5. V z= 6s!d.2 dz 4.5 cubic centimeters. 

6. .4 =0.19642 ±0.00003. 


XIX, Page 562 


1. y = 0.0000008679 j:»'*''\ 

3. y = 94.87 + 5.01321*. 

4. y = 481.6 j-‘**“. 

5. y=113.4c-®‘***-'. 

6. y = 112.25e-® 

7. y = — 0.0017500 + 0.0030335**. 

« 

g * 

^ ^ ~ 0.17645 + 0.055890* ' 


9. y = 


* 

0.18 4- 0.056* ■ 


10. y = 98.9 4 5.017**. 

11. y = 11.100 + 2.7516* — 0.70952** -|- 0.18342**. 


12 . 


IS. 


— ^ = 0.0152778 + 0.0014414*. 
y — 20 

58.275 

" ” 0.010133 4- tan * ’ 


14. Smoothed y’s are: 

1.80, 2.53, 3.21, 3.84, 4.44, 5.04, 5.66, 6.29, 6.90, 7.51, 8.11. 8.72, 9.32. 
9.93, 10.53, 11.14, 11.76, 12.40, 13.04, 13.70. 


15. Smoothed y’s: 

13.5, 11.0, 9.55, 8.80, 8.45, 8.39, 8.58, 8.84, 9.14, 9.38. 9.52, 9.53, 9.39. 
9.04, 8.43, 7.60, 6.70. 


XX, Page 581 

1. y = 4.52 -|- 28.04 cos •* — 2.37 cos 2* 4- 0.90 cos 3* 4- 4.85 cos 4* 
-)- 1.71 cos 5* 4- 0.74 cos 6* — 0.13 sin x — 12.60 sin 2* 

— 3.16 sin 3* + 3.41 sin 4* — 0.22 sin 6*. 



800 


ANSWERS TO EXERCISES 


2. y = 2.60 + 90.76 cos j; -f- 2.39 cos 2a; 32.32 cos 3a; — 6.29 cos 4z 

-f 3.63 cos 6 x -|- 3.42 cos 6 x — 0.01 cos 7x — 6.88 cos 8 x 

— 2.51 cos 9x — 0.06 cos lOx — 2.56 cos 1 lx — 7.08 cos 12 x 
+ 40.99 sin X 4- 3.56 sin 2x 4* 18.15 sin 3x 4* 3.68 sin 4x 

4- 3.60 sin 5x 4- 9.92 sin 6x 4- 1*95 sin 7x — 0.65 sin 8x 

— 3.18 sin 9x — 1.64 sin lOx — 0.20 sin llx. 

4. y = 13.579 — 6.740 cos (^) — 1.760 cos 2(^) 

— 0.499 cos 3 (:^ ) — 1.092 coa4 ( ^) — 0.760 cos 6 ( ^ ) 

14.4 14.4 14.4 

— 0.367 cos 6 ( :^) — 0.288 cos 7 ( ^) — 0.242 oos 8 ( 

14.4 14.4 14.4 

— 0.134 cos 9 ( ^ )- 0.133 cos 10 ( ^ )- 0.129 cos 11 (^ ) 

— 0.046 cos 12 (— 7 ) — 4.035 sin(—- 7 ) — 0.093 sin 2 ( 73 - 7 ) 

14.4 14.4 '14.4' 

— 0.081 sin 3 ( ^ ) — 0.404 sin 4 ( ^ ) 4 - 0.268 sin 6 ( ^ ) 

+ 0.392 sin 6 ( + 0.293 sin 7( ^ + 0.173 am 8 ( ^ ) 

— 0.048 sin 9( + 0.109 sin 10( + 0.013 sin 11 (^) 

14.4 14.4 14.4 



subjects covered are accuracy of approximate 
calculations, interpolation, numerical differ- 
entiation and integration, numerical solution 
of algebraic and transcendental equations, 
numerical solution of ditferential equations, 
theory of errors, precision of measurements, 
and empirical formulas. All subjects and 
topics are illustrated by numerical examples 
worked out in detail 

Numerical IVf atbematical Analysis has been 

adopted as a text in hundreds of colleges 
and universities. Xhe excellence of this new 
edition will make it known siiil more widely. 


JAMKS B. SCARBOROUCiH began a long 
ca^^cer in 1914 on the I acuity of North 
Carolina Uni\ersity and then transferred to 
the C- S. Naval Academ> in 1918. He has 
been Professor Emeritus i>t Vlathematics at 
the latter institution since 19^1). the year in 
which the second edition of Numerical IVlathe- 
matical Analysis was published. Born in 
Nlount Gilead, North Carolina, the distingui- 
shed mathematician took his A. B. and A. Nf . 
degices at Chapel Hill. He was awarded 
the Ph. L>. by the Johns Hopkins University, 
In addition to writing the textbook that has 
become the classic in its field. Or. Scarbo- 
rough is the author of Xhe Gyroscope: Xheory 
and Applications, and Differential Equations 
and Applications. He co-authored Fundamen- 
tals of Statistics and contributed to the 
Handb€>olL of Engineering Mechanics. 



Oxford's Low Priced Textbooks 


EXPERIMENTAL DESIGN Walter T. Federer 

This text presents the subject matter and techniques of experimental 
design not available in other texts. A comprehensive coverage of the 
design and the analysis of experiments is given. 

591 pages Rs. 15.00 

ELEMENTS OF THE DIFFERENTI.AL AND INTEGRAL CALCULUS 

Gramille : Smith : Longley 
This new revised edition retains the features on Hyperbolic Functions 
and cylindrical coordinates employed to broaden the applications of 
double integrations. 

566 pages Second Indian Reprint Rs. 10.00 

PLANE TRIGONOMETRY AND TABLES r.ran>ille ; Smith & 

Mikesh 

In revising Granville’s Plane Trigonometry, the authors have kept in 
mind the features which have made the Granville textbooks popular with 
teachers and students, namely, simplicity and clearness of exposition, 
an abundance of examples worked out in the text and a large variety of 
problems covering a wide range of applications. 

270 pages Rs. 8.00 

A SHORT TABLE OF INI EGRALS B. O. Peirce & R.M. Foster 
Users of these tables must be familiar with the standard forms of the 
indefinite integral of the principal elementary functions, as well as with 
the elementary operations involving indefinite integrals 
189 pages Rs. 6.00 


OXFORD & IBH PUBLISHING CO. 

Calcutta • Bombay • New Delhi 



